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ISOLATED GENES FROM VIRULENT GROUP B STREPTOCOCCUS 

AGALACTIAE 

1. F IE LD OF TH E INVENT I ON 

The present invention relates to genes isolated from Group B streptococci 
("GBS"). More specifically, the present invention relates to genes that are specific to 
virulent forms of GBS and methods of using such genes and their products for the 
diagnosis and treatment of GBS infections. 

2. TECH NI CAL BACKGROUND 

Group B streptococci ("GBS") are a common cause of disease in newborns, 
pregnant women, and other persons. Common manifestations of these infections include 
bacteremia, pneumonia, meningitis, endocarditis, and osteoarticular infections. C.J. 
Baker & M.S. Edwards, Group B Streptococcal Infections, in Infectious Disease of the 
Fetus and Newborn Infant, 980-1054 (J.S. Remington & J.O. Klein, eds., 1995); P. 
Munoz et al., Arch Int Med 157:213-216 (1997). 

Nearly 3 out of every 1,000 children born are infected with an invasive form of 
GBS disease. While GBS disease is of great concern in neonates, GBS is also an 
important pathogen in the general population, in which the incidence of invasive GBS 
disease is nearly 8 in 100,000. Of these infections, the mortality rate can be as high as 
30%. 

During childbirth, GBS can pass from the mother to the newborn. By one 
estimate, up to 30% of pregnant women carry GBS at least temporarily in the vagina or 
rectum without symptoms. Infants born to these women become colonized with GBS 
during delivery. Baker & Edwards, supra. Aspiration of infected amniotic fluid or 
vaginal secretions allow GBS to gain access to the lungs. Adhesion to, and invasion of, 
respiratory epithelium and endothelium appear to be critical factors in early onset 
neonatal infection. Baker & Edwards, supra; C.E. Rubens et al., J Inf Dis 164:320-330 
(1991). Subsequent steps in infection, such as blood stream invasion and the 
establishment of metastatic local infections have not been clarified. The pathogenesis of 
neonatal infection occurring after the first week of life is also not well understood. 
Gastrointestinal colonization may be more important than a respiratory focus in late onset 
neonatal disease. Baker & Edwards, supra. Considerable evidence suggests that invasion 
of brain microvascular endothelial cells by GBS is the initial step in the pathogenesis of 
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meningitis. GBS are able to invade human brain microvascular endothelial cells and type 
III GBS, which are responsible for the majority of meningitis, accomplish this 2-6 times 
more efficiently than other serotypes. V. Nizet et aL Infect Immun 65:5074-5081 (1997). 

Because GBS is widely distributed among the population and is an important 
pathogen in newborns, pregnant women are commonly tested for GBS at 26 to 28 weeks 
5 of pregnancy. Much of GBS neonatal disease is preventable by administration of 

prophylactic antibiotics during labor to women who test positive or display known risk 
factors. However, these antibiotics programs do not prevent all GBS disease. The 
programs are deficient for a number of reasons. First, the programs can be inefficient. 
Second, it is difficult to ensure that all healthcare providers and patients comply with the 

10 testing and treatment. And finally, if new serotypes or antibiotic resistance emerges, the 
antibiotic programs may fail altogether. Currently available tests for GBS are inefficient. 
These tests may provide false negatives. Furthermore, the tests are not specific to virulent 
strains of GBS. Thus, antibiotic treatment may be given unnecessarily and add to the 
problem of antibiotic resistance. Although a vaccine would be advantageous, none are 

1 5 yet commercially available. 

Traditionally, GBS have been divided into 9 serotypes according to the 
immunologic reactivity of the polysaccharide capsule. H.M. Blumberg et aL, JInfDis 
173: 365-373 (1996). Serotype III GBS cause 60-70% of all infections and almost all 
meningitis. Baker & Edwards, supra. Type III GBS can be subdivided into three groups 

20 of related strains based on the analysis of restriction digest patterns (RDPs) produced by 
digestion of chromosomal DNA with Hind III and SseS3S7 I. Y. Nagano et aL, J Med 
Micro 35:297-303 (1991); S. Takahashi et aL, JInfDis 177:1 1 16-1 119 (1998). Figure 1 
illustrates a comparison of Hind III and &e8387 I RDP typing of 62 type III isolates from 
Salt Lake City, Utah and Tokyo, Japan. Isolates were classified into types based on the 

25 similarity of the restriction digest patterns produced by Hind III or SseS387 I digestion of 
chromosomal DNA. The two methods divided the isolates into RDP types containing 
exactly the same isolates: III-3 contains isolates 1-41, II-2 contains isolates 42-59, and II- 
1 contains isolates 60-62. 

Over 90% of invasive type III GBS neonatal disease in Tokyo, Japan and in Salt 

30 Lake City, Utah is caused by bacteria from one of three RDP types, termed RDP type III- 
3, while RDP type III-2 are significantly more likely to be isolated from vagina than from 
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blood or CSF. These results suggest that this genetically-related cluster of type III-3 GBS 
are more virulent than III-2 strains and could be responsible for the majority of invasive 
type III disease globally. 

From the foregoing, it will be appreciated that it would be a significant 
advancement in the art to provide one or more markers that are specific to virulent type 
III-3 GBS. It would be a further advancement to provide a method to exploit these 
markers for clinical identification of virulent type III-3 GBS. It would be a further 
advancement to provide methods for producing vaccines against type III-3 GBS. 

Such compositions and methods are disclosed herein. 



10 3. BRIEF SUMM ARY O F TH E INVENT IO N 

The present invention relates to markers specific to type III-3 GBS. These 
markers, the spbl and spb2 gene products (SEQ ID NO: 2 and SEQ ID NO: 4, 
respectively), are encoded by the spbl (SEQ ID NO: 1) and spb2 (SEQ ID NO: 3) genes. 
The invention also provides these genes and gene products in substantially purified form. 

15 In certain other embodiments, the present invention relates to recombinant vectors 

which incorporate the spbl gene or other nucleic acid molecules that code for the spbl 
gene product. The recombinant vector may be a plasmid. In certain embodiments, the 
recombinant vector is a prokaryotic or eukaryotic expression vector. In certain preferred 
embodiments, the nucleic acid molecule is operably linked to a heterologous promoter 

20 and/or other expression control elements, such as heterologous enhancers and 
polyadenylation sequences. 

In certain other embodiments, the present invention relates to recombinant vectors 
which incorporate the spbl gene or other nucleic acid molecules that code for the spb2 
gene product. The recombinant vector may be a plasmid. In certain embodiments, the 

25 recombinant vector is a prokaryotic or eukaryotic expression vector. In certain preferred 
embodiments, the nucleic acid molecule is operably linked to a heterologous promoter 
and/or other expression control elements. 

The present invention also provides host cells comprising the spbl and/or spb2 
genes. In other embodiments, a host cell of the present invention comprises nucleic acid 

30 molecules that code for the spbl and/or spb2 gene products. The host cell may be a 
prokaryotic or eukaryotic host cell. 
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The present invention also relates to diagnostic methods for determining whether 
a mammal is infected or colonized by virulent GBS. In certain embodiments, a diagnostic 
method comprises the steps of (1) collecting a bodily fluid or culture from the mammal 
and (2) analyzing the bodily fluid or culture for the presence of absence of one or more 
gene products specific to type III-3 GBS, wherein the presence of one or more gene 
5 products specific to type III-3 GBS indicates infection or colonization by virulent GBS. 
The mammal may be a human. Alternatively, the mammal may be a laboratory, 
domestic, or agricultural animal. The bodily fluid or culture may be any bodily fluid or 
culture that is typically analyzed for the presence of bacteria. For example, the bodily 
fluid or culture may be a vaginal or rectovaginal culture. The bodily fluid or culture may 

10 also be a throat culture. The bodily fluid or culture may also be an endotracheal tube 
aspirant, fluid from a brochioaveloar lavage, or tissue from a lung biopsy. In certain 
embodiments, the bodily fluid or culture is blood, serum, amniotic fluid, cerebrospinal 
fluid, or joint fluid. Other sources of material will be apparent to those of skill in the art. 
In certain embodiments, a diagnostic method of the present invention comprises 

15 analyzing a sample for the presence or absence of the spbl and/or spb2 gene product(s). 

In certain embodiments of a diagnostic method of the present invention, the 
polymerase chain reaction ("PCR") is used to identify the presence or absence of the spbl 
and/or spbl gene(s). In certain other embodiments, antibodies are used to identify the 
presence or absence of the spbl and/or spb2 gene products. The antibodies may be 

20 monoclonal or polyclonal antibodies. 

The present invention also relates to GBS vaccines. In certain embodiments, the 
present invention provides vaccines comprising the spbl gene product, i.e., a protein 
comprising the amino acid sequence of SEQ ID NO: 2. In certain other embodiments, a 
vaccine comprises the spb2 gene product, i.e., a protein comprising the amino acid 

25 sequence of SEQ ID NO: 4. In certain preferred embodiments, a vaccine comprises both 
the spbl and spb2 gene products. The vaccine may include an adjuvant, such as alum. In 
certain other embodiments, the spbl and/or spb2 gene(s) may be introduced into a 
mammal using either naked DNA or other gene therapy techniques to induce an immune 
response against type III GBS. 

30 The present invention further provides methods of immunizing a mammal against 

GBS infection. In certain embodiments, such methods comprise administering to the 
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mammal a vaccine comprising an immunologically effective amount of a recombinantly 
produced protein comprising the amino acid sequence of SEQ ID NO: 2. In certain other 
embodiments, a method of the present invention comprises administering to the mammal 
a vaccine comprising an immunologically effective amount of a recombinantly produced 
protein comprising the amino acid sequence of SEQ ID NO: 4. The vaccine may also 
5 contain a mixture of the spbl and spb2 gene products. Vaccines used in the methods of 
the present invention may further comprise an adjuvant, such as alum. 

These and other features and advantages of the present invention will become 
more fully apparent from the following detailed description. 



10 4. SU MM ARY OF TH E DRA WING S 

Figure 1 illustrates a comparison of Hindll! and SseS3S71 RDP typing of 62 type 
III GBS isolates from Salt Lake City, Utah and Tokyo, Japan. Isolates were classified 
into types based on the similarity of the restriction digest patterns produced by Hindlll or 
&e83871 digestion of chromosomal DNA. The two methods divided isolates into RDP 

15 types containing exactly the same isolates: III-3 contains isolates 1-41, III-2 contains 
isolates 42 - 59, and III-l contains isolates 60 - 62. 

Figure 2 illustrates a dot blot hybridization of probe 1 with genomic DNA isolated 
from type III GBS. 10 |ig of genomic DNA from each of 62 type III GBS strains was 
transferred to nylon membrane. Radiolabeled probe 1 hybridized with DNA from all III- 

20 3 strains (rows A - D) including the original type III-3 strain (well El). The probe failed 
to hybridize with DNA from III-2 strains (Fl - F10, Gl - G7) including the original strain 
used in the subtraction hybridization (well E10) and III-l strains (wells HI - H3). The 
same pattern of hybridization was observed using clone 3 and 1 1 probes. 



25 5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to the identification and prevention of infections by 
virulent forms of GBS. The present invention also relates to isolated genes specific to 
type III-3 GBS. These genes, spbl and spbl, encode the spbl and spb2 gene products. 

The spbl and spb2 genes or other nuclceic acid moleucles coding for the spbl or 
30 spb2 gene products may be incorporated into a recombinant vector using methods known 
in the art. See, e.g., 1-3 J. Sambrook et al., Molecular Cloning: A Laboratory Manual 
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(2d ed. 1989). Recombinant vectors include any genetic element, such as a plasmid, 
phage, transposon, cosmid, chromosome, virus, etc., that is capable of replication when 
associated with the proper control elements and that can transfer gene sequences between 
cells. Thus, the term includes cloning and expression vehicles. 

A nucleic acid molecule of the present invention may be operably linked to 
5 expression control sequences, such as heterologous promoters. Examples include, but are 
not limited to, viral promoters such as the SV40 early promoter and the CMV immediate 
early promoter region, bacterial promoters, mammalian promoters, inducible promoters, 
synthetic promoters, hybrid promoters, and the like. Other expression control sequences 
are known in the art and include polyadenylation signals, transcription termination 

10 sequences, upstream regulatory domains, origins of replication, internal ribosome entry 
sites ("IRES"), and enhancers. These expression control sequences collectively provide 
for the replication, transcription and translation of a coding sequence in a recipient cell. 
Not all of these control sequences need always be present in a recombinant vector, so 
long as the selected coding sequence is capable of being replicated, transcribed and 

15 translated in an appropriate host cell. 

Recombinant vectors can be constructed to include selectable markers. Suitable 
markers include genes which confer antibiotic resistance or sensitivity, or impart color, or 
change the antigenic characteristics when host cells which have been transfected with the 
recombinant vectors are grown in an appropriate selective medium. Suitable markers are 

20 known to those of skill in the art. 

The discovery of type III-3 GBS-specific gene products will allow clinicians to 
diagnose and treat infection and colonization with virulent GBS. For example, 
hybridization-based assays may be used to determine whether a GBS isolate is type III-3. 
Figure 2 illustrates the results of hybridization assays with a III-3-specific probe. 

25 Likewise, PCR may be used to detect the presence or absence of either the spbl gene or 
the spb2 gene (or both) in samples from patients. PCR methods are described generally 
in C.R. Newton & A. Graham, PCR (2nd. ed. 1997); PCR: Essential Techniques (J.F. 
Burke ed., 1996). Patients who are infected with type III-3 GBS may then receive 
appropriate antibiotic therapy. 
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Antibodies may also be used to detect the presence or absence of the spbl and/or 
spb2 gene product(s). Methods for preparing both monoclonal and polyclonal antibodies 
are described in, e.g., E. Harlow & D. Lane, Antibodies: A Laboratory Manual (1988). 

The present invention also relates to methods for producing type III-3 GBS 
vaccines. See generally Vaccine Protocols (A. Robinson, G.H. Farrar & C.N. Wiblin eds. 
5 1996). In certain embodiments, the spbl and/or spb2 gene product(s) may be used to 

immunize against GBS. These gene products may be produced in large quantities using 
techniques that are known in the art. For example, the appropriate gene or genes may be 
linked to a prokaryotic promoter and expressed in bacteria. The gene products may then 
be purified using conventional techniques and used to vaccinate at-risk individuals. 

1 0 Alternatively, the appropriate gene or genes may be linked to a eukaryotic promoter and 
enhancer (e.g., yeast, baculovirus, SV40, etc.) and expressed in an appropriate cell type. 
The gene products may then be purified using conventional techniques. 

The spbl and/or spb2 gene products, or immunogenic fragments thereof, may 
stimulate an immune response when administered to a host. Recombinantly produced 

15 proteins are especially desirable, as they can be produced in large amounts and purified. 
Furthermore, recombinantly produced proteins may be engineered to maximize desirable 
activities and to minimize unwanted effects. 

The recombinantly produced spbl and/or spb2 gene products may be used as 
carrier proteins for a polysaccharide-protein or oligosaccharide-protein conjugate vaccine. 

20 See, e.g., R. Schneerson, et al., Infect Immun 60:3528-3532 (1992) (describing a 

Pneumococcus -pertussis toxin conjugate vaccine). For example, Haemophilus influenzae 
B vaccines have been produced by conjugating a tetanus toxoid; a Corynebacteriaum 
toxin, CRM 197 (which is a mutant diptheria toxin); and a Neisseria outer membrane 
protein. Oligo- and polysaccharides from GBS might be used in a vaccine. 

25 Oligosaccharide- and polysaccharide-protein conjugates alter the immunological 
properties of the polysaccharide or oligosaccharide and may improve the immune 
response. 

An adjuvant may be used to enhance the immune response to a vaccine containing 
the spbl and/or spb2 gene products. An adjuvant is any substance that enhances the 
30 immune response to an antigen. Without being bound by any particular theory, adjuvants 
may act by retaining the antigen locally near the site of administration to produce a depot 
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effect, facilitating the slow, sustained release of the antigen to cells of the immune 
system. Adjuvants may also attract cells of the immune system. Aluminum hydroxide 
and aluminum phosphate (collectively and commonly referred to as "alum") are routinely 
used as adjuvants in human and veterinary vaccines. Currently, alum is the only adjuvant 
licensed for human use, although a number of experimental adjuvants are being tested. 
5 The spbl and/or spb2 gene(s) may also be introduced into a mammal using either 

naked DNA or other gene therapy techniques to induce an immune response against 
virulent GBS. 

All publications, patents, and patent applications cited in this application are 
hereby incorporated by reference. U.S. Patent Application Serial No. 60/140,084 is 
10 hereby incorporated by reference in its entirety. 

6. EXA M PLES 

The following examples are given to illustrate several embodiments which have 
been made within the scope of the present invention. It is to be understood that these 

15 examples are neither comprehensive nor exhaustive of the many types of embodiments 
which can be prepared in accordance with the present invention. 

Example 1 - Isolation of the spbl and spb2 genes 
Bacterial factors that contribute to the increased virulence of III-3 strains can be 
identified by characterizing the differences between the genetic composition of III-3 and 

20 III-2 strains. Such genetic differences will be found in the bacterial chromosomes since 
these strains do not contain plasmids. Takahashi et al., supra. 

To identify genes present in virulent type III-3 GBS strains and not in the 
avirulent type III-2 strains, a modification of the technique described by Lisitsyn et al., 
Science 259:946-951 (1993), was used. High molecular weight genomic DNA from an 

25 invasive RDP type III-3 GBS strain (strain 874391) and a colonizing ("avirulent") RDP 
type III-2 strain (strain 865043) were prepared by cell lysis with mutanolysin and 
Proteinase K digestion. Y. Nagano et al., supra. For genetic subtraction, genomic DNA 
from both strains was digested with Taq I. Tag I-digested DNA from the virulent strain 
was mixed with two complementary oligonucleotides, TaqA (5'-CTAGGTGGA- 

30 TCCTTCGGCAAT-3 9 (SEQ ID NO: 5)) and TaqB (5'-CGATTGCCGA-3' (SEQ ID NO: 
6)), heated to 50° C for 5 minutes, then allowed to cool slowly to 16 °C in T4 ligase 

8 
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buffer. Oligonucleotides were ligated to the virulent strain DNA by incubation with 20 
units of T4 ligase at 16 °C for 12 hours. After ligation, 500 ng of DNA from the virulent 
strain, with ligated linkers, and 40 jig of DNA from the avirulent strain, without linkers, 
was mixed together, denatured by heating, and hybridized at 68 °C for 20 hours. 

Ten percent of the resulting hybridization mixture was incubated with Taq DNA 
5 polymerase and dNTPs to fill in the ends of annealed virulent strain DNA. The 

hybridized DNA was amplified by Taq DNA polymerase for 10 cycles using the TaqA 
oligonucleotide as the forward and reverse amplification primer. After amplification, 
single stranded products remaining after amplification were digested with mung bean 
nuclease. Twenty percent of the resulting product was then reamplified for 20 cycles. 

1 0 This process of subtraction followed by PCR amplification results in enhanced 

amplification of DNA segments from the III-3 strains that do not hybridize with DNA 
segments from the III-2 strains. 

A total of four cycles of subtraction and amplification were carried out using 
successively smaller quantities of III-3 specific PCR products. Two pairs of 

1 5 oligonucleotides were used for subtraction, which were alternated with successive rounds 
of subtraction-amplification. The oligos were TaqA (SEQ ID NO: 5) and TaqB (SEQ ID 
NO: 6) (the first pair) and TaqE (5 '-AGGCAACTGTGCTAACCGAGGGAAT-3 ' (SEQ 
ID NO: 7)) and TaqF (5'-CGATTCCCTCG-3' (SEQ ID NO: 8)) (the second pair). The 
final amplification products were ligated into pBS KS+ vector and transformed into 

20 competent XLl-Blue strain E. coli. Thirteen clones were randomly selected for analysis. 
Cross hybridization studies of the 13 inserts revealed that 6 were unique. These probes 
were used in slot and dot blot experiments to determine whether subtraction was 
successful and to identify probes hybridizing with all III-3 strains. Each of the 6 unique 
probes hybridized with the parental III-3 virulent strain, while none of the probes 

25 hybridized with the avirulent III-2 strains. Three of the amplified sequence tags (clones 
1, 3 and 11) hybridized with genomic DNA from all 62 type III-3 isolates, but did not 
hybridize with DNA prepared from the III-2 and III-l isolates. 

Figure 2 illustrates a dot blot hybridization of type III GBS genomic DNA 
hybridized with a clone 1 probe. 10 jag of genomic DNA from each of 62 type III GBS 

30 strains was transferred to nylon membrane. Radiolabeled clone 1 probe hybridized with 
DNA from all III-3 strains (rows A-D) including the original type III-3 strain (well E-l). 
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The probe failed to hybridize with DNA from III-2 strains (F1-F10, G1-G7) including the 
original strain used in the subtraction hybridization (well E10) and III-l strains (wells Hl- 
H3). The same pattern of hybridiztion was observed using clone 3 and 1 1 probes. These 
data demonstrate the feasibility of identifying genes unique to III-3 strains by this method 
of PCR-based sub tractive hybridization, and further support the validity of the RDP 
5 typing for identifying genetically-related type III GBS. 

The three GBS type III-3 -specific sequence tags are short (130-360 bp). To obtain 
additional sequence information, a genomic GBS III-3 library was constructed. High 
molecular weight GBS genomic DNA was partially digested with Bgl II and cloned into X 
FIX II phage vector. Phage were packaged and the library, consisting of 1 .7 x 10 5 

1 0 recombinant phage containing inserts with a mean size of about 1 8 kb (totaling 

approximately 3 x 10 9 bp), was amplified once. Multiple plaques hybridizing with each 
of the III-3 GBS-specific probes were purified for further characterization. 

Three overlapping genomic clones hybridizing with probe 1 were identified, with 
approximate sizes of 9, 22, and 23 kb. Since the boundaries of GBS III-3 specific DNA 

15 are not known, smaller fragments were subcloned and the DNA was verified present in 
virulent GBS strains before proceeding with further characterization. The first segment 
examined is a 6.4 kb Sal l-Bgl II fragment. This genomic DNA is present in all RDP type 
III-3 strains and in none of 17 RDP type III-2 strains. 

Over 90% of this genomic DNA fragment has been sequenced and found to 

20 contain 5 open reading frames ("ORFs"). Three of these are likely to be authentic genes. 
They feature ATG start sites, are preceded by potential ribosomal binding sites and, in 
two cases, stop codons are followed by palindromic sequences that may represent 
transcriptional terminators. They are relatively short, however, and do not show 
significant homology at the nucleic acid or amino acid level with sequences registered 

25 with Genbank or the available bacterial genome databases. 

The two other ORFs appear to be more obvious candidates for virulence genes. 
The spbl gene includes a 1509 bp ORF that is preceded by a potential ribosomal binding 
site 10 bases upstream from an ATG start codon. The predicted protein (502 amino acids 
and Mr 53,446) has the characteristics of a cell wall-bound protein. The N-terminus of 

30 the predicted protein is a hydrophilic, basic stretch of 6 amino acids followed by a 23 
amino acid hydrophobic, proline-rich core, consistent with a signal peptide. The 

10 
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hydrophilic mature protein terminates in a typical LPXTG (SEQ ID NO: 9) domain that 
immediately precedes a hydrophobic 20 amino acid core and a short, basic hydrophilic 
terminus. 

The nucleotide sequence is not homologous to sequences of other known bacterial 
genes. The translated amino acid sequence, however, shares segmental homology with a 
5 number of characterized proteins, including the fimbrial type 2 protein of Actinomyces 
naeslundii (27% identity over 350 amino acids) and the fimbrial type 1 protein of 
Actinomyces viscosus (25% homology over 420 amino acids), the T6 surface protein of S. 
pyogenes (23% identity over 359 amino acids), and the hsf (27% identity over 260 amino 
acids ) and HMW1 adhesins (25% identity over 285 amino acids) of Haemophilus 

10 influenzae. See M.K. Yeung & J.O. Cisar, J Bacteriol 172:242-2468 (1990); O. 

Schneewind, et ah, J Bacteriol 172:3310-3317 (1990); J.W. St. Geme III, et aL, J 
Bacteriol 178:6281-6287 (1996); J.W. St. Geme III, Infect Immun 62: 3881-3889 (1994). 
The function of the S. pyogenes T6 protein is unknown. Each of the other homologues 
plays a role in bacterial adhesion and/or invasion. 

15 The second ORF, spb2, terminates 37 bp upstream from spbl and is in the same 

transcriptional orientation. This 1692 bp ORF has a deduced amino acid sequence of 563 
residues and Mr 64,492. It shares 50.5% nucleic acid identity and 20.7% amino acid 
identity with spbl. Conservation is highest in the carboxy-terminal regions, including a 
shared LPSTGG (SEQ ID NO: 10) motif. In contrast to spbl, spbl does not have an 

20 obvious signal sequence. Its secretion may be mediated by carboxy-terminal recognition 
sequences or by accessory peptides. T. Michiels, et al., Infect Immun 58:2840-2849 
(1990). The deduced amino acid sequence of spb2 is also homologous with S. pyogenes 
T6 and Actinomyces naeslundii proteins, and to Listeria monocytogenes internalin A 
(22% identity over 308 amino acids) — again, proteins important in adhesion and invasion. 

25 Neither of the predicted gene products has the repetitive structure of previously 

described GBS surface proteins such as the C and Rib proteins or of L. monocytogenes 
internalin family members. L.C. Madoff et al., Infect Immun 59:2638-2644 (1991); J. 
Gaillard, et al., Cell 65: 1127-1141 (1991). Hybridization of the originating strain 
874391 genomic DNA with an spbl probe under low stringency conditions identifies a 

30 single band in EcoR I, Sal I and Sst I digests (data not shown), suggesting that a single 
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copy of spbl is present in this strain and that spbl is not a member of a significantly 
homologous "family" of genes. 

Example 2 - Adherence and Invasion Assays Using spbl' Bacteria 
Genomic subtraction identified a 1509 bp ORF, spbl, which is present in virulent 
RDP type III-3 GBS and not in RDP type III-2 strains. The predicted 53 kD protein 
product has the characteristics of a typical gram positive cell-wall bound protein. The 
nucleic acid sequence of spbl is not homologous to sequences of other known bacterial 
genes, however the translated amino acid sequence shares segmental homology with 
several characterized adhesins, including Actinomyces fimbrial proteins and H. influenzae 
HMW1, suggesting that Spbl might contribute to GBS adhesion or invasion. A spbl' 
isogenic deletion mutant GBS strain was created by homologous recombination and the 
ability of the spbl' mutant to adhere to and invade A549 respiratory epithelial cells was 
determined. Compared to the wild type strain, the number of spbl' bacteria adherent to 
A549 monolayers was reduced by 60.0% (p<0.01) and the number of intracellular 
invading bacteria was reduced by 53.6% (p<0.01). Without being bound by any 
particular theory, it appears that Spbl may contribute to the pathogenesis of GBS 
pneumonia and bacterial entry into the bloodstream. 
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CLAIMS: 

1. An isolated nucleic acid molecule comprising nucleotides which code for the 
amino acid sequence of SEQ ID NO: 2. 

2. A recombinant vector comprising the nucleic acid molecule of claim 1 . 

3. The recombinant vector of claim 2, wherein said recombinant vector is a plasmid. 
5 4. The recombinant vector of claim 2, wherein said recombinant vector is a 

prokaryotic or eukaryotic expression vector. 

5. The recombinant vector of claim 2, wherein the nucleic acid molecule is operably 
linked to a heterologous promoter. 

6. A host cell comprising the nucleic acid molecule of claim 1. 

10 7. The host cell of claim 6, wherein the host cell is a eukaryotic host cell. 

8. The host cell of claim 6, wherein the host cell is a prokaryotic host cell. 

9. An isolated nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO: 1. 

10. A recombinant vector comprising the nucleic acid molecule of claim 9. 
15 11. The recombinant vector of claim 10, wherein said recombinant vector is a 

plasmid. 

12. The recombinant vector of claim 10, wherein said recombinant vector is a 
prokaryotic or eukaryotic expression vector. 

13. The recombinant vector of claim 10, wherein the nucleic acid molecule is 
20 operably linked to a heterologous promoter. 

14. A host cell comprising the nucleic acid molecule of claim 9. 

15. The host cell of claim 14, wherein the host cell is a eukaryotic host cell. 

16. The host cell of claim 14, wherein the host cell is a prokaryotic host cell. 

17. An isolated nucleic acid molecule comprising nucleotides which code for the 
25 amino acid sequence of SEQ ID NO: 4 

18. A recombinant vector comprising the nucleic acid molecule of claim 17. 

19. The recombinant vector of claim 18, wherein said recombinant vector is a 
plasmid. 

20. The recombinant vector of claim 1 8, wherein said recombinant vector is a 
30 prokaryotic or eukaryotic expression vector. 
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21 . The recombinant vector of claim 1 8, wherein the nucleic acid molecule is 
operably linked to a heterologous promoter. 

22. A host cell comprising the nucleic acid molecule of claim 17. 

23. The host cell of claim 22, wherein the host cell is a eukaryotic host cell. 

24. The host cell of claim 22, wherein the host cell is a prokaryotic host cell. 

5 25. An isolated nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO: 3. 

26. A recombinant vector comprising the nucleic acid molecule of claim 25. 

27. The recombinant vector of claim 26, wherein said recombinant vector is a 
plasmid. 

10 28. The recombinant vector of claim 26, wherein said recombinant vector is a 
prokaryotic or eukaryotic expression vector. 

29. The recombinant vector of claim 26, wherein the nucleic acid molecule is 
operably linked to a heterologous promoter. 

30. A host cell comprising the nucleic acid molecule of claim 25. 

15 31. The host cell of claim 30, wherein the host cell is a eukaryotic host cell. 

32. The host cell of claim 30, wherein the host cell is a prokaryotic host cell. 

33. A method of immunizing a mammal against Group B streptococci infection, said 
method comprising administering to the mammal a vaccine comprising an 
immunologically effective amount of a recombinantly produced protein comprising the 

20 amino acid sequence of SEQ ID NO: 2. 

34. The method of claim 33, wherein the vaccine further comprises an adjuvant. 

35. The method of claim 34, wherein the adjuvant comprises alum. 

36. The method of claim 33, wherein the vaccine further comprises an 
immunologically effective amount of a recombinantly produced protein comprising the 

25 amino acid sequence of SEQ ID NO: 4. 

37. The method of claim 36, wherein the vaccine further comprises an adjuvant. 

38. A method of immunizing a mammal against Group B streptococci infection, said 
method comprising administering to the mammal a vaccine comprising an 
immunologically effective amount of a recombinantly produced protein comprising the 

30 amino acid sequence of SEQ ID NO: 4. 

39. The method of claim 38, wherein the vaccine further comprises an adjuvant. 
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40. The method of claim 39, wherein the adjuvant comprises alum. 

41. A diagnostic method for determining whether a mammal is infected or colonized 
by virulent Group B streptococci (GBS), said method comprising the steps of: 

(a) collecting a bodily fluid or culture from the mammal; 

(b) analyzing the bodily fluid or culture for the presence or absence of one or 
5 more gene products specific to type III-3 GBS; 

wherein the presence of one or more gene products specific to type III-3 GBS indicates 
infection or colonization by virulent GBS. 

42. The diagnostic method of claim 41, wherein the one or more gene products 
specific to type III-3 GBS comprise a protein, said protein comprising the amino acid 

1 0 sequence of SEQ ID NO: 2. 

43. The diagnostic method of claim 41, wherein the one or more gene products 
specific to type III-3 GBS comprise a protein, said protein comprising the amino acid 
sequence of SEQ ID NO: 4. 

44. The diagnostic method of claim 4 1 , wherein the one or more gene products 

15 specific to type III-3 GBS comprise a first protein comprising the amino acid sequence of 
SEQ ID NO: 2 and a second protein comprising the amino acid sequence of SEQ ED 
NO: 4. 

45. The diagnostic method of claim 41, wherein the mammal is a human. 

46. The diagnostic method of claim 41, wherein the bodily fluid or culture is a vaginal 
20 or rectovaginal culture. 

47. The diagnostic method of claim 41, wherein the bodily fluid or culture is a throat 
culture. 

48. The diagnostic method of claim 41 , wherein the bodily fluid or culture is blood, 
serum, amniotic fluid, cerebrospinal fluid, or joint fluid. 

25 49. The diagnostic method of claim 41 , wherein the analysis step comprises using 

polymerase chain reaction (PCR) to identify the presence or absence of one or more gene 
products specific to type III-3 GBS. 

50. The diagnostic method of claim 49, wherein the analysis step comprises using 
PCR to determine the presence or absence of the spbl gene product. 
30 51. The diagnostic method of claim 49, wherein the analysis step comprises using 
PCR to determine the presence or absence of the spb2 gene product. 
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52. The diagnostic method of claim 41 , wherein the analysis step comprises using 
antibodies to identify the presence or absence of one or more gene products specific to 
type III-3 GBS. 

53. The diagnostic method of claim 52, wherein the antibodies are monoclonal 
antibodies. 

5 54. The diagnostic method of claim 53, wherein the monoclonal antibodies are 
specific for the spbl gene product. 

55. The diagnostic method of claim 53, wherein the monoclonal antibodies are 
specific for the spb2 gene product. 

56. An isolated and purified protein comprising the amino acid sequence of SEQ ID 
10 NO: 2. 

57. A vaccine for immunizing a mammalian host against virulent Group B 
streptococci infection, said vaccine comprising the protein of claim 56. 

58. The vaccine of claim 57, further comprising an adjuvant. 

59. The vaccine of claim 58, wherein the adjuvant comprises alum. 

15 60. The vaccine of claim 57, wherein the protein is conjugated to a bacterial 
polysaccharide or oligosaccharide. 

61 . An isolated and purified protein comprising the amino acid sequence of SEQ ID 
NO: 4. 

62. A vaccine for immunizing a mammalian host against virulent Group B 
20 streptococci infection, said vaccine comprising the protein of claim 61 . 

63. The vaccine of claim 62, further comprising an adjuvant. 

64. The vaccine of claim 63, wherein the adjuvant comprises alum. 

65. The vaccine of claim 62, further comprising an isolated and purified protein 
comprising the amino acid sequence of SEQ ID NO: 2. 

25 66. The vaccine of claim 65, further comprising an adjuvant. 

67. The vaccine of claim 62, wherein the protein is conjugated to a bacterial 
polysaccharide or oligosaccharide. 



16 



WO 00/78787 



PCTAJS00/17082 



SIMILARITY ISOLATE NO. SIMILARITY 




Figure 1. Type m GBS RDP types. Comparison of 
HindHL and Sse83871 RDP typing of 62 type ffl GBS 
isolates from Salt Lake City and Tokyo. Isolates were 
classified into types based on the similarity of the 
restriction digest patterns produced by HindHL or 
Sse83871 digestion of chromosomal DNA. The two 
methods divided the isolates into RDP types containing 
exactly the same isolates: EH-3 contains isolates 1-41, 
IH-2 contains isolates 42 - 59, and m-1 contains isolates 
60-62. 
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Figure 2. RDP type m_ 3 specific probes. Dot blot hybridization of 
probe 1 with genomic DNA isolated from type III GBS 10 ug of 
genomic DNA from each of 62 type III GBS strains was transferred to 
nylon membrane. Radiolabeled probe 1 hybridized with DNA from all 
III-3 Istrains (rows A-D) including the original type III-3 strain (well E- 

i.nV^ ^ failed t0 h y bridizc ^ °NA from III-2 strains (Fl- 
FIO^Gl-7) including the original strain used in the subtraction 
hybndizalion (well E 10) and III-l strains (wells HI -3). The same 
pattern of hybridization was observed using clone 3 and 11 probes 
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<110> Adderson, Elisabeth E. 
Bohnsack, John F. 

University of Utah Research Foundation 

<12 0> Isolated Genes from Virulent Group B Streptococcus 
agalactiae 

<130> 1321.2.29 

<140> 
<141> 

<150> 60/140, 084 

<151> 1999-06-21 

<160> 10 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1509 
<212> DNA 

<213> Streptococcus agalactiae 

<220> 

<221> CDS 

<222> (1) . . (1509) 

<400> 1 

atg aaa aag aaa atg att caa teg ctg tta gtg gcg agt tta gca ttt 48 
Met Lys Lys Lys Met lie Gin Ser Leu Leu Val Ala Ser Leu Ala Phe 
15 10 15 

ggt atg get gta tea cca gtt acg ccg ata get ttt gee get gag aca 96 
Gly Met Ala Val Ser Pro Val Thr Pro lie Ala Phe Ala Ala Glu Thr 

20 25 30 

ggg aca att aca gtt caa gat act caa aaa ggc gca ace tat aaa gca 144 
Gly Thr lie Thr Val Gin Asp Thr Gin Lys Gly Ala Thr Tyr Lys Ala 

35 40 45 

tat aaa gtt ttt gat gca gaa ata gat aat gca aat gta tct gat teg 192 
Tyr Lys Val Phe Asp Ala Glu lie Asp Asn Ala Asn Val Ser Asp Ser 
50 55 60 

aat aaa gat gga get tct tat tta att cct caa ggt aaa gaa get gag 240 
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Asn Lys Asp Gly Ala Ser Tyr Leu lie Pro Gin Gly Lys Glu Ala Glu 
65 70 75 80 



tat aaa get tea act gat ttt aat tct ctt ttt acg aca act act aat 288 
Tyr Lys Ala Ser Thr Asp Phe Asn Ser Leu Phe Thr Thr Thr Thr Asn 

85 90 95 



99 a 999 a 9 a aca tat gta act aaa aaa gat act gcg tea gca aat gag 336 
Gly Gly Arg Thr Tyr Val Thr Lys Lys Asp Thr Ala Ser Ala Asn Glu 

100 105 110 



att gcg aca tgg get aaa tct ata tea get aat act aca cca gtt tec 384 
lie Ala Thr Trp Ala Lys Ser lie Ser Ala Asn Thr Thr Pro Val Ser 
115 120 125 



act gtt act gag tea aat aat gat ggt act gag gtt att aat gtt tec 432 
Thr Val Thr Glu Ser Asn Asn Asp Gly Thr Glu Val lie Asn Val Ser 
130 135 140 



caa tat gga tat tat tat gtt tct age act gtt aat aat gga get gta 480 
Gin Tyr Gly Tyr Tyr Tyr Val Ser Ser Thr Val Asn Asn Gly Ala Val 
145 150 155 160 



att atg gtt aca tct gta act cca aat get act att cat gaa aag aat 528 
lie Met Val Thr Ser Val Thr Pro Asn Ala Thr lie His Glu Lys Asn 

165 170 175 



act gat gcg aca tgg gga gat ggt ggt gga aaa act gta gat caa aaa 576 
Thr Asp Ala Thr Trp Gly Asp Gly Gly Gly Lys Thr Val Asp Gin Lys 

180 185 190 



acg tac teg gtt ggt gat aca gtc aaa tat act att act tat aag aat 624 
Thr Tyr Ser Val Gly Asp Thr Val Lys Tyr Thr lie Thr Tyr Lys Asn 
195 200 205 



gca gtc aat tat cat ggt aca gaa aaa gtg tat caa tat gtt ata aag 672 
Ala Val Asn Tyr His Gly Thr Glu Lys Val Tyr Gin Tyr Val lie Lys 
210 215 220 



gat act atg cca tct get tct gta gtt gat ttg aac gaa ggg tct tat 720 
Asp Thr Met Pro Ser Ala Ser Val Val Asp Leu Asn Glu Gly Ser Tyr 
225 230 235 240 



gaa gta act att act gat gga tea ggg aat att aca act eta act caa 768 
Glu Val Thr lie Thr Asp Gly Ser Gly Asn lie Thr Thr Leu Thr Gin 

245 250 255 



ggt teg gaa aaa gca act ggg aag tat aac ctg tta gag gaa aat aat 816 
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Gly Ser Glu Lys Ala Thr Gly Lys Tyr Asn Leu Leu Glu Glu Asn Asn 

260 265 270 



aat ttc acg att act att ccg tgg gca get acc aat act cca acc gga 864 
Asn Phe Thr lie Thr lie Pro Trp Ala Ala Thr Asn Thr Pro Thr Gly 
275 280 285 



aat act caa aat gga get aat gat gac ttt ttt tat aag gga ata aat 912 
Asn Thr Gin Asn Gly Ala Asn Asp Asp Phe Phe Tyr Lys Gly lie Asn 
290 295 300 



aca ate aca gtc act tat aca gga gta tta aag agt gga get aaa cca 960 
Thr lie Thr Val Thr Tyr Thr Gly Val Leu Lys Ser Gly Ala Lys Pro 
305 310 315 320 



ggt tea get gat tta cca gaa aat aca aac att gcg acc ate aac ccc 1008 
Gly Ser Ala Asp Leu Pro Glu Asn Thr Asn lie Ala Thr lie Asn Pro 

325 330 335 



aat act age aat gat gac cca ggt caa aaa gta aca gtg agg gat ggt 1056 
Asn Thr Ser Asn Asp Asp Pro Gly Gin Lys Val Thr Val Arg Asp Gly 

340 345 350 



caa att act ata aaa aaa att gat ggt tec aca aaa get tea tta caa 1104 
Gin lie Thr lie Lys Lys lie Asp Gly Ser Thr Lys Ala Ser Leu Gin 
355 360 365 



99t get ata ttt gtt tta aag aat get acg ggt caa ttt eta aac ttt 1152 
Gly Ala lie Phe Val Leu Lys Asn Ala Thr Gly Gin Phe Leu Asn Phe 
370 375 380 



aac gat aca aat aac gtt gaa tgg 
Asn Asp Thr Asn Asn Val Glu Trp 
385 390 

tat aca aca gga gca gat ggt ata 
Tyr Thr Thr Gly Ala Asp Gly lie 

405 



ggc aca gaa get aat gca aca gaa 1200 
Gly Thr Glu Ala Asn Ala Thr Glu 
395 400 

att acc att aca ggc ttg aaa gaa 1248 
lie Thr lie Thr Gly Leu Lys Glu 
410 415 



ggt aca tac tat eta gtt gag aaa aag get ccc tta ggt tac aat ttg 1296 
Gly Thr Tyr Tyr Leu Val Glu Lys Lys Ala Pro Leu Gly Tyr Asn Leu 

420 425 430 



tta gat aac tct cag aag gtt att tta gga gat gga gee act gat acg 1344 
Leu Asp Asn Ser Gin Lys Val lie Leu Gly Asp Gly Ala Thr Asp Thr 
435 440 445 



act aat tea gat aac ctt tta gtt aac cca act gtt gaa aat aac aaa 1392 
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Thr Asn Ser Asp Asn Leu Leu Val Asn Pro Thr Val Glu Asn Asn Lys 
450 455 460 

ggt act gag ttg cct tea aca ggt ggt att ggt aca aca att ttc tac 1440 

Gly Thr Glu Leu Pro Ser Thr Gly Gly lie Gly Thr Thr lie Phe Tyr 

465 470 475 480 

att ata ggt gca att tta gta ata gga gca ggt ate gtg ctt gtt get 1488 

lie He Gly Ala He Leu Val He Gly Ala Gly He Val Leu Val Ala 

485 490 495 

cgt cgt cgt tta cgt tct taa 1509 

Arg Arg Arg Leu Arg Ser 

500 



<210> 2 
<211> 502 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 2 

Met Lys Lys Lys Met He Gin Ser Leu Leu Val Ala Ser Leu Ala Phe 

15 10 15 

Gly Met Ala Val Ser Pro Val Thr Pro He Ala Phe Ala Ala Glu Thr 

20 25 30 

Gly Thr He Thr Val Gin Asp Thr Gin Lys Gly Ala Thr Tyr Lys Ala 

35 40 45 

Tyr Lys Val Phe Asp Ala Glu He Asp Asn Ala Asn Val Ser Asp Ser 

50 55 60 

Asn Lys Asp Gly Ala Ser Tyr Leu He Pro Gin Gly Lys Glu Ala Glu 
65 70 75 80 

Tyr Lys Ala Ser Thr Asp Phe Asn Ser Leu Phe Thr Thr Thr Thr Asn 

85 90 95 

Gly Gly Arg Thr Tyr Val Thr Lys Lys Asp Thr Ala Ser Ala Asn Glu 

100 105 110 

He Ala Thr Trp Ala Lys Ser He Ser Ala Asn Thr Thr Pro Val Ser 

115 120 125 

Thr Val Thr Glu Ser Asn Asn Asp Gly Thr Glu Val He Asn Val Ser 

130 135 140 

Gin Tyr Gly Tyr Tyr Tyr Val Ser Ser Thr Val Asn Asn Gly Ala Val 
145 150 155 160 

He Met Val Thr Ser Val Thr Pro Asn Ala Thr He His Glu Lys Asn 

165 170 175 

Thr Asp Ala Thr Trp Gly Asp Gly Gly Gly Lys Thr Val Asp Gin Lys 

180 185 190 

Thr Tyr Ser Val Gly Asp Thr Val Lys Tyr Thr He Thr Tyr Lys Asn 
195 200 205 
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Ala Val Asn Tyr His Gly Thr Glu Lys Val Tyr Gin Tyr Val lie Lys 

210 215 220 

Asp Thr Met Pro Ser Ala Ser Val Val Asp Leu Asn Glu Gly Ser Tyr 
225 230 235 240 

Glu Val Thr lie Thr Asp Gly Ser Gly Asn lie Thr Thr Leu Thr Gin 

245 250 255 

Gly Ser Glu Lys Ala Thr Gly Lys Tyr Asn Leu Leu Glu Glu Asn Asn 

260 265 270 

Asn Phe Thr lie Thr lie Pro Trp Ala Ala Thr Asn Thr Pro Thr Gly 

275 280 285 

Asn Thr Gin Asn Gly Ala Asn Asp Asp Phe Phe Tyr Lys Gly lie Asn 

290 295 300 

Thr lie Thr Val Thr Tyr Thr Gly Val Leu Lys Ser Gly Ala Lys Pro 
305 310 315 320 

Gly Ser Ala Asp Leu Pro Glu Asn Thr Asn lie Ala Thr lie Asn Pro 

325 330 335 

Asn Thr Ser Asn Asp Asp Pro Gly Gin Lys Val Thr Val Arg Asp Gly 

340 345 350 

Gin lie Thr lie Lys Lys lie Asp Gly Ser Thr Lys Ala Ser Leu Gin 

355 360 365 

Gly Ala lie Phe Val Leu Lys Asn Ala Thr Gly Gin Phe Leu Asn Phe 

370 375 380 

Asn Asp Thr Asn Asn Val Glu Trp Gly Thr Glu Ala Asn Ala Thr Glu 
385 390 395 400 

Tyr Thr Thr Gly Ala Asp Gly lie lie Thr lie Thr Gly Leu Lys Glu 

405 410 415 

Gly Thr Tyr Tyr Leu Val Glu Lys Lys Ala Pro Leu Gly Tyr Asn Leu 

420 425 430 

Leu Asp Asn Ser Gin Lys Val lie Leu Gly Asp Gly Ala Thr Asp Thr 

435 440 445 

Thr Asn Ser Asp Asn Leu Leu Val Asn Pro Thr Val Glu Asn Asn Lys 

450 455 460 

Gly Thr Glu Leu Pro Ser Thr Gly Gly lie Gly Thr Thr lie Phe Tyr 
465 470 475 480 

lie lie Gly Ala lie Leu Val lie Gly Ala Gly lie Val Leu Val Ala 

485 490 495 

Arg Arg Arg Leu Arg Ser 

500 



<210> 3 
<211> 1692 
<212> DNA 

<213> Streptococcus agalactiae 

<220> 
<221> CDS 
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<400> 3 

att tgc att atg gtg ate gta ttc egg att ata cag ata tta caa ggg 48 

He Cys He Met Val He Val Phe Arg He He Gin He Leu Gin Gly 
1 5 10 15 

att ata tec aag ate ctt cag gta cat att att ata agt atg att cac 96 
He He Ser Lys He Leu Gin Val His He He He Ser Met He His 

20 25 30 

gag ata aag ate ccg act caa eta aag atg cct att ata cga cag ata 144 
Glu He Lys He Pro Thr Gin Leu Lys Met Pro He He Arg Gin He 

35 40 45 

eta gtc tea tea aat gtt gat aca aca act aag tac aag tac gta aaa 192 
Leu Val Ser Ser Asn Val Asp Thr Thr Thr Lys Tyr Lys Tyr Val Lys 
50 55 60 

gac get tac aaa tta gtc ggt tgg tat tat gtt aat cca tat ggt agt 240 
Asp Ala Tyr Lys Leu Val Gly Trp Tyr Tyr Val Asn Pro Tyr Gly Ser 
65 70 75 80 

att aga cct tat aac ttt tea ggt get gta act caa gat ate aat tta 288 
He Arg Pro Tyr Asn Phe Ser Gly Ala Val Thr Gin Asp He Asn Leu 

85 90 95 

aga get att tgg cga aag get gga gat tat cat att ata tac age aat 336 
Arg Ala He Trp Arg Lys Ala Gly Asp Tyr His He He Tyr Ser Asn 

100 105 110 

gat get gtt ggt aca gat gga aag cca gca ttg gat get tct ggt cag 3 84 
Asp Ala Val Gly Thr Asp Gly Lys Pro Ala Leu Asp Ala Ser Gly Gin 
115 120 125 

caa tta caa aca agt aat gag cct act gac cct gat tec tat gac gat 432 
Gin Leu Gin Thr Ser Asn Glu Pro Thr Asp Pro Asp Ser Tyr Asp Asp 
130 135 140 

ggc tec cat tea gee tta ctg aga cgt ccg aca atg cca gat ggc tat 480 
Gly Ser His Ser Ala Leu Leu Arg Arg Pro Thr Met Pro Asp Gly Tyr 
14 5 150 155 160 

cgt ttc cgt ggc tgg tgg tac aat ggt aaa att tat aac cca tat gat 52 8 
Arg Phe Arg Gly Trp Trp Tyr Asn Gly Lys He Tyr Asn Pro Tyr Asp 

165 170 175 

tec att gat att gac gee cat tta gca gat get aat aaa aat ate ace 576 
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Ser lie Asp lie Asp Ala His Leu Ala Asp Ala Asn Lys Asn lie Thr 

180 185 190 



ata aaa cct gtc att att cca gta gga gat ate aaa tta gaa gat acc 624 

lie Lys Pro Val lie lie Pro Val Gly Asp lie Lys Leu Glu Asp Thr 
195 200 205 

tec ate aaa tac aat ggt aac ggt ggt act aga gta gaa aat ggt aat 672 

Ser lie Lys Tyr Asn Gly Asn Gly Gly Thr Arg Val Glu Asn Gly Asn 

210 215 220 



gtg gta aca caa gtg gag aca ccg cgt atg gag ttg aat age aca act 720 
Val Val Thr Gin Val Glu Thr Pro Arg Met Glu Leu Asn Ser Thr Thr 
225 230 235 240 



aca att cct gaa aac caa tac ttt aca agg aca ggt tac aac ctt att 
Thr lie Pro Glu Asn Gin Tyr Phe Thr Arg Thr Gly Tyr Asn Leu lie 

245 250 255 



768 



ggt tgg cat cat gat aag gat tta get gat aca gga cgt gtg gaa ttt 816 
Gly Trp His His Asp Lys Asp Leu Ala Asp Thr Gly Arg Val Glu Phe 

260 265 270 



aca gca ggt caa tea ata ggt att gat aac aac ctt gat gca aca aat 864 
Thr Ala Gly Gin Ser lie Gly lie Asp Asn Asn Leu Asp Ala Thr Asn 
275 280 285 



acc tta tat get gtt tgg caa cct aaa gaa tac acc gtc gga gta agt 912 
Thr Leu Tyr Ala Val Trp Gin Pro Lys Glu Tyr Thr Val Gly Val Ser 
290 295 300 



aaa act gtc gtt gga eta gat gaa gat aag acg aaa gac ttc ttg ttt 960 
Lys Thr Val Val Gly Leu Asp Glu Asp Lys Thr Lys Asp Phe Leu Phe 
305 310 315 320 



aat cca agt gaa acg ttg caa caa gag aat ttt ccg ctg aga gat ggt 10 0 8 
Asn Pro Ser Glu Thr Leu Gin Gin Glu Asn Phe Pro Leu Arg Asp Gly 

325 330 335 



cag act aag gaa ttt aaa gta cct tat gga act tct ata tea ata gat 1056 
Gin Thr Lys Glu Phe Lys Val Pro Tyr Gly Thr Ser lie Ser lie Asp 

340 345 350 



gaa caa gee tac gat gaa ttt aaa gta tct gag tea att aca gaa aaa 1104 
Glu Gin Ala Tyr Asp Glu Phe Lys Val Ser Glu Ser lie Thr Glu Lys 
355 360 365 



aat eta gca act ggt gaa get gat aaa act tat gat get acc ggc tta 1152 
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Asn Leu Ala Thr Gly Glu Ala Asp Lys Thr Tyr Asp Ala Thr Gly Leu 
370 375 380 

caa tec ctg aca gtt tea gga gac gta gat att age ttt ace aat aca 1200 

Gin Ser Leu Thr Val Ser Gly Asp Val Asp lie Ser Phe Thr Asn Thr 

385 390 395 400 

cgt ate aag caa aaa gta cga eta cag aaa gtt aat gtc gaa aat gat 124 8 

Arg lie Lys Gin Lys Val Arg Leu Gin Lys Val Asn Val Glu Asn Asp 

405 410 415 

aat aat ttt tta gca ggt gca gtt ttt gat att tat gaa tea gat get 1296 

Asn Asn Phe Leu Ala Gly Ala Val Phe Asp lie Tyr Glu Ser Asp Ala 

420 425 430 

aat ggg aat aaa get tea cat cct atg tat tea ggg ctg gtg aca aac 1344 

Asn Gly Asn Lys Ala Ser His Pro Met Tyr Ser Gly Leu Val Thr Asn 
435 440 445 

gat aaa ggc ttg tta tta gtg gat get aat aac tac etc agt ttg cca 1392 

Asp Lys Gly Leu Leu Leu Val Asp Ala Asn Asn Tyr Leu Ser Leu Pro 
450 455 460 

gta gga aaa tac tac eta aca gag aca aag gee cct cca ggg tac eta 1440 

Val Gly Lys Tyr Tyr Leu Thr Glu Thr Lys Ala Pro Pro Gly Tyr Leu 

465 470 475 480 

eta cct aaa aat gat gat ata tea gta tta gtg att tct acg gga gtt 148 8 

Leu Pro Lys Asn Asp Asp lie Ser Val Leu Val lie Ser Thr Gly Val 

485 490 495 

acc ttt gaa caa aat ggt aat aat gcg aca cca ata aaa gag aat tta 153 6 

Thr Phe Glu Gin Asn Gly Asn Asn Ala Thr Pro lie Lys Glu Asn Leu 

500 505 510 

gtg gat gga agt aca gta tat act ttt aaa att act aac agt aaa gga 1584 

Val Asp Gly Ser Thr Val Tyr Thr Phe Lys lie Thr Asn Ser Lys Gly 
515 520 525 

aca gaa ttg cct agt act gga ggt att gga aca cac att tat ate eta 1632 
Thr Glu Leu Pro Ser Thr Gly Gly lie Gly Thr His lie Tyr lie Leu 
530 535 540 

gtt ggt tta get tta get eta cca tea gga tta ata tta tac tat cga 1680 
Val Gly Leu Ala Leu Ala Leu Pro Ser Gly Leu lie Leu Tyr Tyr Arg 

545 550 555 560 



aaa aaa ata tga 



1692 
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Lys Lys lie 
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<210> 4 
<211> 563 
<212> PRT 

<213> Streptococcus agalactiae 



<400> 4 

lie Cys lie Met 
1 

lie lie Ser Lys 

20 

Glu lie Lys lie 

35 

Leu Val Ser Ser 
50 

Asp Ala Tyr Lys 
65 

lie Arg Pro Tyr 

Arg Ala lie Trp 

100 

Asp Ala Val Gly 
115 

Gin Leu Gin Thr 
130 

Gly Ser His Ser 
145 

Arg Phe Arg Gly 

Ser lie Asp lie 

180 

lie Lys Pro Val 
195 

Ser lie Lys Tyr 
210 

Val Val Thr Gin 
225 

Thr lie Pro Glu 

Gly Trp His His 

260 

Thr Ala Gly Gin 
275 

Thr Leu Tyr Ala 
290 

Lys Thr Val Val 



Val lie Val Phe 
5 

lie Leu Gin Val 

Pro Thr Gin Leu 

40 

Asn Val Asp Thr 

55 

Leu Val Gly Trp 
70 

Asn Phe Ser Gly 
85 

Arg Lys Ala Gly 

Thr Asp Gly Lys 

120 

Ser Asn Glu Pro 
135 

Ala Leu Leu Arg 
150 

Trp Trp Tyr Asn 
165 

Asp Ala His Leu 

He He Pro Val 

200 

Asn Gly Asn Gly 
215 

Val Glu Thr Pro 
230 

Asn Gin Tyr Phe 
245 

Asp Lys Asp Leu 

Ser He Gly He 

280 

Val Trp Gin Pro 
295 

Gly Leu Asp Glu 



Arg He He Gin 
10 

His He He He 
25 

Lys Met Pro He 

Thr Thr Lys Tyr 

60 

Tyr Tyr Val Asn 

75 

Ala Val Thr Gin 
90 

Asp Tyr His He 
105 

Pro Ala Leu Asp 

Thr Asp Pro Asp 

140 

Arg Pro Thr Met 
155 

Gly Lys He Tyr 
170 

Ala Asp Ala Asn 
185 

Gly Asp He Lys 

Gly Thr Arg Val 

220 

Arg Met Glu Leu 
235 

Thr Arg Thr Gly 
250 

Ala Asp Thr Gly 
265 

Asp Asn Asn Leu 

Lys Glu Tyr Thr 

300 

Asp Lys Thr Lys 



He Leu Gin Gly 

15 

Ser Met He His 
30 

He Arg Gin He 
45 

Lys Tyr Val Lys 

Pro Tyr Gly Ser 

80 

Asp He Asn Leu 

95 

He Tyr Ser Asn 
110 

Ala Ser Gly Gin 
125 

Ser Tyr Asp Asp 

Pro Asp Gly Tyr 

160 

Asn Pro Tyr Asp 
175 

Lys Asn He Thr 
190 

Leu Glu Asp Thr 
205 

Glu Asn Gly Asn 

Asn Ser Thr Thr 

240 

Tyr Asn Leu He 
255 

Arg Val Glu Phe 
270 

Asp Ala Thr Asn 
285 

Val Gly Val Ser 
Asp Phe Leu Phe 
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305 310 315 320 

Asn Pro Ser Glu Thr Leu Gin Gin Glu Asn Phe Pro Leu Arg Asp Gly 

325 330 335 

Gin Thr Lys Glu Phe Lys Val Pro Tyr Gly Thr Ser lie Ser lie Asp 

340 345 350 

Glu Gin Ala Tyr Asp Glu Phe Lys Val Ser Glu Ser lie Thr Glu Lys 

355 360 365 

Asn Leu Ala Thr Gly Glu Ala Asp Lys Thr Tyr Asp Ala Thr Gly Leu 

370 375 380 

Gin Ser Leu Thr Val Ser Gly Asp Val Asp lie Ser Phe Thr Asn Thr 
385 390 395 400 

Arg lie Lys Gin Lys Val Arg Leu Gin Lys Val Asn Val Glu Asn Asp 

405 410 415 

Asn Asn Phe Leu Ala Gly Ala Val Phe Asp lie Tyr Glu Ser Asp Ala 

420 425 430 

Asn Gly Asn Lys Ala Ser His Pro Met Tyr Ser Gly Leu Val Thr Asn 

435 440 445 

Asp Lys Gly Leu Leu Leu Val Asp Ala Asn Asn Tyr Leu Ser Leu Pro 

450 455 460 

Val Gly Lys Tyr Tyr Leu Thr Glu Thr Lys Ala Pro Pro Gly Tyr Leu 
465 470 475 480 

Leu Pro Lys Asn Asp Asp lie Ser Val Leu Val lie Ser Thr Gly Val 

485 490 495 

Thr Phe Glu Gin Asn Gly Asn Asn Ala Thr Pro lie Lys Glu Asn Leu 

500 505 510 

Val Asp Gly Ser Thr Val Tyr Thr Phe Lys lie Thr Asn Ser Lys Gly 

515 520 525 

Thr Glu Leu Pro Ser Thr Gly Gly lie Gly Thr His lie Tyr lie Leu 

530 535 540 

Val Gly Leu Ala Leu Ala Leu Pro Ser Gly Leu lie Leu Tyr Tyr Arg 
545 550 555 560 

Lys Lys lie 



<210> 5 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
oligonucleotide 

<400> 5 

ctaggtggat ccttcggcaa t 21 
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<210> 6 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
oligonucleotide 

<400> 6 

cgattgccga 10 



<210> 7 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
oligonucleotide 

<400> 7 

aggcaactgt gctaaccgag ggaat 2 5 



<210> 8 

<211> 11 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
oligonucleotide 

<400> 8 

cgattccctc g 11 



<210> 9 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Consensus 
sequence 
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<400> 9 

Leu Pro Xaa Thr Gly 
1 5 



<210> 10 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Consensus 
sequence 

<400> 10 

Leu Pro Ser Thr Gly Gly 
1 5 
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(54) Title: ISOLATED GENES FROM VIRULENT GROUP B STREPTOCOCCUS AG ALACTIAE 

(57) Abstract: The present invention relates to the identification and prevention of infections by virulent forms of Group B strepto- 
cocci. Disclosed herein is the identification of two genes, spbl <<mdspb2, that are specific to virulent type III-3 GBS. Also disclosed 
herein are diagnostic methods for detecting virulent GBS infections and methods of immunizing a mammal against these bacteria. 
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ISOLATED GENES FROM VIRULENT GROUP B STREPTOCOCCUS 

AGALACTIAE 



1. F IE LD OF TH E INVENTION 

The present invention relates to genes isolated from Group B streptococci 
("GBS")- More specifically, the present invention relates to genes that are specific to 
virulent forms of GBS and methods of using such genes and their products for the 
diagnosis and treatment of GBS infections. 



2. T ECHNIC AL BACKGROUND 

10 Group B streptococci ("GBS") are a common cause of disease in newborns, 

pregnant women, and other persons. Common manifestations of these infections include 
bacteremia, pneumonia, meningitis, endocarditis, and osteoarticular infections. CJ. 
Baker & M.S. Edwards, Group B Streptococcal Infections, in Infectious Disease of the 
Fetus and Newborn Infant, 980-1054 (J.S. Remington & TO. Klein, eds., 1995); P. 

15 Munoz et al., Arch Int Med 157:213-216 (1997). 

Nearly 3 out of every 1 ,000 children born are infected with an invasive form of 
GBS disease. While GBS disease is of great concern in neonates, GBS is also an 
important pathogen in the general population, in which the incidence of invasive GBS 
disease is nearly 8 in 100,000. Of these infections, the mortality rate can be as high as 

20 30%. 

During childbirth, GBS can pass from the mother to the newborn. By one 
estimate, up to 30% of pregnant women carry GBS at least temporarily in the vagina or 
rectum without symptoms. Infants born to these women become colonized with GBS 
during delivery. Baker & Edwards, supra. Aspiration of infected amniotic fluid or 

25 vaginal secretions allow GBS to gain access to the lungs. Adhesion to, and invasion of, 
respiratory epithelium and endothelium appear to be critical factors in early onset 
neonatal infection. Baker & Edwards, supra; C.E. Rubens et al., J InfDis 164:320-330 
(1991). Subsequent steps in infection, such as blood stream invasion and the 
establishment of metastatic local infections have not been clarified. The pathogenesis of 

30 neonatal infection occurring after the first week of life is also not well understood. 

Gastrointestinal colonization may be more important than a respiratory focus in late onset 
neonatal disease. Baker & Edwards, supra. Considerable evidence suggests that invasion 
of brain microvascular endothelial cells by GBS is the initial step in the pathogenesis of 
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meningitis. GBS are able to invade human brain microvascular endothelial cells and type 
III GBS, which are responsible for the majority of meningitis, accomplish this 2-6 times 
more efficiently than other serotypes. V. Nizet et al. Infect Immun 65:5074-5081 (1997). 

Because GBS is widely distributed among the population and is an important 
pathogen in newborns, pregnant women are commonly tested for GBS at 26 to 28 weeks 
5 of pregnancy. Much of GBS neonatal disease is preventable by administration of 

prophylactic antibiotics during labor to women who test positive or display known risk 
factors. However, these antibiotics programs do not prevent all GBS disease. The 
programs are deficient for a number of reasons. First, the programs can be inefficient. 
Second, it is difficult to ensure that all healthcare providers and patients comply with the 

10 testing and treatment. And finally, if new serotypes or antibiotic resistance emerges, the 
antibiotic programs may fail altogether. Currently available tests for GBS are inefficient. 
These tests may provide false negatives. Furthermore, the tests are not specific to virulent 
strains of GBS. Thus, antibiotic treatment may be given unnecessarily and add to the 
problem of antibiotic resistance. Although a vaccine would be advantageous, none are 

1 5 yet commercially available. 

Traditionally, GBS have been divided into 9 serotypes according to the 
immunologic reactivity of the polysaccharide capsule. H.M. Blumberg et al., J InfDis 
173: 365-373 (1996). Serotype III GBS cause 60-70% of all infections and almost all 
meningitis. Baker & Edwards, supra. Type III GBS can be subdivided into three groups 

20 of related strains based on the analysis of restriction digest patterns (RDPs) produced by 
digestion of chromosomal DNA with Hind III and &e8387 I. Y. Nagano et al., J Med 
Micro 35:297-303 (1991); S. Takahashi et al., J InfDis 177:1116-1119 (1998). Figure 1 
illustrates a comparison of Hind III and SseS3S7 I RDP typing of 62 type III isolates from 
Salt Lake City, Utah and Tokyo, Japan. Isolates were classified into types based on the 

25 similarity of the restriction digest patterns produced by Hind III or SseS3S7 I digestion of 
chromosomal DNA. The two methods divided the isolates into RDP types containing 
exactly the same isolates: III-3 contains isolates 1-41, II-2 contains isolates 42-59, and II- 

1 contains isolates 60-62. 

Over 90% of invasive type III GBS neonatal disease in Tokyo, Japan and in Salt 
30 Lake City, Utah is caused by bacteria from one of three RDP types, termed RDP type III- 
3, while RDP type III-2 are significantly more likely to be isolated from vagina than from 



2 



WO 00/78787 PCT/US00/17082 

blood or CSF. These results suggest that this genetically-related cluster of type III-3 GBS 
are more virulent than III-2 strains and could be responsible for the majority of invasive 
type III disease globally. 

From the foregoing, it will be appreciated that it would be a significant 
advancement in the art to provide one or more markers that are specific to virulent type 
III-3 GBS. It would be a further advancement to provide a method to exploit these 
markers for clinical identification of virulent type III-3 GBS. It would be a further 
advancement to provide methods for producing vaccines against type III-3 GBS. 

Such compositions and methods are disclosed herein. 



10 3. BEJEE SUMMARY OF THE INVENTION 

The present invention relates to markers specific to type III-3 GBS. These 
markers, the spbl and spbl gene products (SEQ ID NO: 2 and SEQ ID NO: 4, 
respectively), are encoded by the spbl (SEQ ID NO: 1) and spb2 (SEQ ID NO: 3) genes. 
The invention also provides these genes and gene products in substantially purified form. 

15 In certain other embodiments, the present invention relates to recombinant vectors 

which incorporate the spbl gene or other nucleic acid molecules that code for the spbl 
gene product. The recombinant vector may be a plasmid. In certain embodiments, the 
recombinant vector is a prokaryotic or eukaryotic expression vector. In certain preferred 
embodiments, the nucleic acid molecule is operably linked to a heterologous promoter 

20 and/or other expression control elements, such as heterologous enhancers and 
polyadenylation sequences. 

In certain other embodiments, the present invention relates to recombinant vectors 
which incorporate the spbl gene or other nucleic acid molecules that code for the spb2 
gene product. The recombinant vector may be a plasmid. In certain embodiments, the 

25 recombinant vector is a prokaryotic or eukaryotic expression vector. In certain preferred 
embodiments, the nucleic acid molecule is operably linked to a heterologous promoter 
and/or other expression control elements. 

The present invention also provides host cells comprising the spbl and/or spb2 
genes. In other embodiments, a host cell of the present invention comprises nucleic acid 

30 molecules that code for the spbl and/or spb2 gene products. The host cell may be a 
prokaryotic or eukaryotic host cell. 
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The present invention also relates to diagnostic methods for determining whether 
a mammal is infected or colonized by virulent GBS. In certain embodiments, a diagnostic 
method comprises the steps of (1) collecting a bodily fluid or culture from the mammal 
and (2) analyzing the bodily fluid or culture for the presence of absence of one or more 
gene products specific to type III-3 GBS, wherein the presence of one or more gene 
5 products specific to type III-3 GBS indicates infection or colonization by virulent GBS. 
The mammal may be a human. Alternatively, the mammal may be a laboratory, 
domestic, or agricultural animal. The bodily fluid or culture may be any bodily fluid or 
culture that is typically analyzed for the presence of bacteria. For example, the bodily 
fluid or culture may be a vaginal or rectovaginal culture. The bodily fluid or culture may 

10 also be a throat culture. The bodily fluid or culture may also be an endotracheal tube 
aspirant, fluid from a brochioaveloar lavage, or tissue from a lung biopsy. In certain 
embodiments, the bodily fluid or culture is blood, serum, amniotic fluid, cerebrospinal 
fluid, or joint fluid. Other sources of material will be apparent to those of skill in the art. 
In certain embodiments, a diagnostic method of the present invention comprises 

15 analyzing a sample for the presence or absence of the spbl and/or spb2 gene product(s). 

In certain embodiments of a diagnostic method of the present invention, the 
polymerase chain reaction ("PCR") is used to identify the presence or absence of the spbl 
and/or spb2 gene(s). In certain other embodiments, antibodies are used to identify the 
presence or absence of the spbl and/or spb2 gene products. The antibodies may be 

20 monoclonal or polyclonal antibodies. 

The present invention also relates to GBS vaccines. In certain embodiments, the 
present invention provides vaccines comprising the spbl gene product, i.e., a protein 
comprising the amino acid sequence of SEQ ID NO: 2. In certain other embodiments, a 
vaccine comprises the spb2 gene product, i.e., a protein comprising the amino acid 

25 sequence of SEQ ID NO: 4. In certain preferred embodiments, a vaccine comprises both 
the spbl and spb2 gene products. The vaccine may include an adjuvant, such as alum. In 
certain other embodiments, the spbl and/or spb2 gene(s) may be introduced into a 
mammal using either naked DNA or other gene therapy techniques to induce an immune 

response against type III GBS. 
30 The present invention further provides methods of immunizing a mammal against 

GBS infection. In certain embodiments, such methods comprise administering to the 
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mammal a vaccine comprising an immunologically effective amount of a recombinant ly 
produced protein comprising the amino acid sequence of SEQ ID NO: 2. In certain other 
embodiments, a method of the present invention comprises administering to the mammal 
a vaccine comprising an immunologically effective amount of a recombinantly produced 
protein comprising the amino acid sequence of SEQ ID NO: 4. The vaccine may also 
5 contain a mixture of the spbl and spb2 gene products. Vaccines used in the methods of 
the present invention may further comprise an adjuvant, such as alum. 

These and other features and advantages of the present invention will become 
more fully apparent from the following detailed description. 

10 4. SUMM ARY OF THE DRAWINGS 

Figure 1 illustrates a comparison of Hindlll and &e83871 RDP typing of 62 type 
III GBS isolates from Salt Lake City, Utah and Tokyo, Japan. Isolates were classified 
into types based on the similarity of the restriction digest patterns produced by Hindlll or 
&e83871 digestion of chromosomal DNA. The two methods divided isolates into RDP 

15 types containing exactly the same isolates: III-3 contains isolates 1-41, III-2 contains 
isolates 42-59, and III-l contains isolates 60-62. 

Figure 2 illustrates a dot blot hybridization of probe 1 with genomic DNA isolated 
from type III GBS. 10 jig of genomic DNA from each of 62 type III GBS strains was 
transferred to nylon membrane. Radiolabeled probe 1 hybridized with DNA from all III- 

20 3 strains (rows A - D) including the original type III-3 strain (well El). The probe failed 
to hybridize with DNA from III-2 strains (Fl - F10, Gl - G7) including the original strain 
used in the subtraction hybridization (well E10) and III-l strains (wells HI - H3). The 
same pattern of hybridization was observed using clone 3 and 1 1 probes. 

25 5. DETAILED DESCRIPTION OF THE INVENT ION 

The present invention relates to the identification and prevention of infections by 
virulent forms of GBS. The present invention also relates to isolated genes specific to 
type III-3 GBS. These genes, spbl and spb2, encode the spbl and spb2 gene products. 

The spbl and spb2 genes or other nuclceic acid moleucles coding for the spbl or 
30 spb2 gene products may be incorporated into a recombinant vector using methods known 
in the art. See, e.g., 1-3 J. Sambrook et al., Molecular Cloning: A Laboratory Manual 
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(2d ed. 1989). Recombinant vectors include any genetic element, such as a plasmid, 
phage, transposon, cosmid, chromosome, virus, etc., that is capable of replication when 
associated with the proper control elements and that can transfer gene sequences between 
cells. Thus, the term includes cloning and expression vehicles. 

A nucleic acid molecule of the present invention may be operably linked to 
5 expression control sequences, such as heterologous promoters. Examples include, but are 
not limited to, viral promoters such as the SV40 early promoter and the CMV immediate 
early promoter region, bacterial promoters, mammalian promoters, inducible promoters, 
synthetic promoters, hybrid promoters, and the like. Other expression control sequences 
are known in the art and include polyadenylation signals, transcription termination 

10 sequences, upstream regulatory domains, origins of replication, internal ribosome entry 
sites ("IRES"), and enhancers. These expression control sequences collectively provide 
for the replication, transcription and translation of a coding sequence in a recipient cell. 
Not all of these control sequences need always be present in a recombinant vector, so 
long as the selected coding sequence is capable of being replicated, transcribed and 

1 5 translated in an appropriate host cell. 

Recombinant vectors can be constructed to include selectable markers. Suitable 
markers include genes which confer antibiotic resistance or sensitivity, or impart color, or 
change the antigenic characteristics when host cells which have been transfected with the 
recombinant vectors are grown in an appropriate selective medium. Suitable markers are 

20 known to those of skill in the art. 

The discovery of type III-3 GBS-specific gene products will allow clinicians to 
diagnose and treat infection and colonization with virulent GBS. For example, 
hybridization-based assays may be used to determine whether a GBS isolate is type III-3. 
Figure 2 illustrates the results of hybridization assays with a III-3-specific probe. 

25 Likewise, PCR may be used to detect the presence or absence of either the spbl gene or 
the spb2 gene (or both) in samples from patients. PCR methods are described generally 
in C.R. Newton & A. Graham, PCR (2nd. ed. 1997); PCR: Essential Techniques (J.F. 
Burke ed., 1996). Patients who are infected with type III-3 GBS may then receive 
appropriate antibiotic therapy. 
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Antibodies may also be used to detect the presence or absence of the spbl and/or 
spb2 gene product(s). Methods for preparing both monoclonal and polyclonal antibodies 
are described in, e.g., E. Harlow & D. Lane, Antibodies: A Laboratory Manual (1988). 

The present invention also relates to methods for producing type III-3 GBS 
vaccines. See generally Vaccine Protocols (A. Robinson, G.H. Farrar & C.N. Wiblin eds. 
5 1996). In certain embodiments, the spbl and/or spb2 gene product(s) may be used to 

immunize against GBS. These gene products may be produced in large quantities using 
techniques that are known in the art. For example, the appropriate gene or genes may be 
linked to a prokaryotic promoter and expressed in bacteria. The gene products may then 
be purified using conventional techniques and used to vaccinate at-risk individuals. 

10 Alternatively, the appropriate gene or genes may be linked to a eukaryotic promoter and 
enhancer (e.g., yeast, baculo virus, SV40, etc.) and expressed in an appropriate cell type. 
The gene products may then be purified using conventional techniques. 

The spbl and/or spb2 gene products, or immunogenic fragments thereof, may 
stimulate an immune response when administered to a host. Recombinantly produced 

15 proteins are especially desirable, as they can be produced in large amounts and purified. 
Furthermore, recombinantly produced proteins may be engineered to maximize desirable 
activities and to minimize unwanted effects. 

The recombinantly produced spbl and/or spb2 gene products may be used as 
carrier proteins for a polysaccharide-protein or oligosaccharide-protein conjugate vaccine. 

20 See, e.g., R. Schneerson, et al., Infect Immun 60:3528-3532 (1992) (describing a 

Pneumococcus-pertussis toxin conjugate vaccine). For example, Haemophilus influenzae 
B vaccines have been produced by conjugating a tetanus toxoid; a Corynebacteriaum 
toxin, CRM 197 (which is a mutant diptheria toxin); and a Neisseria outer membrane 
protein. Oligo- and polysaccharides from GBS might be used in a vaccine. 

25 Oligosaccharide- and polysaccharide-protein conjugates alter the immunological 
properties of the polysaccharide or oligosaccharide and may improve the immune 
response. 

An adjuvant may be used to enhance the immune response to a vaccine containing 
the spbl and/or spb2 gene products. An adjuvant is any substance that enhances the 
30 immune response to an antigen. Without being bound by any particular theory, adjuvants 
may act by retaining the antigen locally near the site of administration to produce a depot 
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effect, facilitating the slow, sustained release of the antigen to cells of the immune 
system. Adjuvants may also attract cells of the immune system. Aluminum hydroxide 
and aluminum phosphate (collectively and commonly referred to as "alum") are routinely 
used as adjuvants in human and veterinary vaccines. Currently, alum is the only adjuvant 
licensed for human use, although a number of experimental adjuvants are being tested. 
5 The spbl and/or spb2 gene(s) may also be introduced into a mammal using either 

naked DNA or other gene therapy techniques to induce an immune response against 
virulent GBS. 

All publications, patents, and patent applications cited in this application are 
hereby incorporated by reference. U.S. Patent Application Serial No. 60/140,084 is 
10 hereby incorporated by reference in its entirety. 

6. EXA M PLES 

The following examples are given to illustrate several embodiments which have 
been made within the scope of the present invention. It is to be understood that these 

15 examples are neither comprehensive nor exhaustive of the many types of embodiments 
which can be prepared in accordance with the present invention. 

Example 1 - Isolation of the spbl and spb2 genes 
Bacterial factors that contribute to the increased virulence of III-3 strains can be 
identified by characterizing the differences between the genetic composition of III-3 and 

20 III-2 strains. Such genetic differences will be found in the bacterial chromosomes since 
these strains do not contain plasmids. Takahashi et al., supra. 

To identify genes present in virulent type III-3 GBS strains and not in the 
avirulent type III-2 strains, a modification of the technique described by Lisitsyn et al., 
Science 259:946-951 (1993), was used. High molecular weight genomic DNA from an 

25 invasive RDP type III-3 GBS strain (strain 874391) and a colonizing ("avirulent") RDP 
type III-2 strain (strain 865043) were prepared by cell lysis with mutanolysin and 
Proteinase K digestion. Y. Nagano et al., supra. For genetic subtraction, genomic DNA 
from both strains was digested with Tag I. Tag I-digested DNA from the virulent strain 
was mixed with two complementary oligonucleotides, TaqA (5'-CTAGGTGGA- 

30 TCCTTCGGC AAT-3 ' (SEQ ID NO: 5)) and TaqB (5'-CGATTGCCGA-3' (SEQ ID NO: 
6)), heated to 50° C for 5 minutes, then allowed to cool slowly to 16°C in T4 ligase 
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buffer. Oligonucleotides were ligated to the virulent strain DNA by incubation with 20 
units of T4 ligase at 16°C for 12 hours. After ligation, 500 ng of DNA from the virulent 
strain, with ligated linkers, and 40 \ig of DNA from the avirulent strain, without linkers, 
was mixed together, denatured by heating, and hybridized at 68 °C for 20 hours. 

Ten percent of the resulting hybridization mixture was incubated with Tag DNA 
5 polymerase and dNTPs to fill in the ends of annealed virulent strain DNA. The 

hybridized DNA was amplified by Tag DNA polymerase for 10 cycles using the TaqA 
oligonucleotide as the forward and reverse amplification primer. After amplification, 
single stranded products remaining after amplification were digested with mung bean 
nuclease. Twenty percent of the resulting product was then reamplified for 20 cycles. 

10 This process of subtraction followed by PCR amplification results in enhanced 

amplification of DNA segments from the III-3 strains that do not hybridize with DNA 
segments from the III-2 strains. 

A total of four cycles of subtraction and amplification were carried out using 
successively smaller quantities of III-3 specific PCR products. Two pairs of 

1 5 oligonucleotides were used for subtraction, which were alternated with successive rounds 
of subtraction-amplification. The oligos were TaqA (SEQ ID NO: 5) and TaqB (SEQ ID 
NO: 6) (the first pair) and TaqE (5 '-AGGC AACTGTGCTAACCGAGGGAAT-3 ' (SEQ 
ID NO: 7)) and TaqF (5'-CGATTCCCTCG-3' (SEQ ID NO: 8)) (the second pair). The 
final amplification products were ligated into pBS KS+ vector and transformed into 

20 competent XLl-Blue strain E. coli. Thirteen clones were randomly selected for analysis. 
Cross hybridization studies of the 13 inserts revealed that 6 were unique. These probes 
were used in slot and dot blot experiments to determine whether subtraction was 
successful and to identify probes hybridizing with all III-3 strains. Each of the 6 unique 
probes hybridized with the parental III-3 virulent strain, while none of the probes 

25 hybridized with the avirulent III-2 strains. Three of the amplified sequence tags (clones 
1, 3 and 11) hybridized with genomic DNA from all 62 type III-3 isolates, but did not 
hybridize with DNA prepared from the III-2 and III-l isolates. 

Figure 2 illustrates a dot blot hybridization of type III GBS genomic DNA 
hybridized with a clone 1 probe. 10 |ig of genomic DNA from each of 62 type III GBS 

30 strains was transferred to nylon membrane. Radiolabeled clone 1 probe hybridized with 
DNA from all III-3 strains (rows A-D) including the original type III-3 strain (well E-l). 
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The probe failed to hybridize with DNA from III-2 strains (F1-F10, G1-G7) including the 
original strain used in the subtraction hybridization (well E10) and III-l strains (wells Hl- 
H3). The same pattern of hybridiztion was observed using clone 3 and 1 1 probes. These 
data demonstrate the feasibility of identifying genes unique to III-3 strains by this method 
of PCR-based subtractive hybridization, and further support the validity of the RDP 
5 typing for identifying genetically-related type III GBS. 

The three GBS type III-3-specific sequence tags are short (130-360 bp). To obtain 
additional sequence information, a genomic GBS III-3 library was constructed. High 
molecular weight GBS genomic DNA was partially digested with Bgl II and cloned into X 
FIX II phage vector. Phage were packaged and the library, consisting of 1 .7 x 10 5 

10 recombinant phage containing inserts with a mean size of about 1 8 kb (totaling 

approximately 3 x 10 9 bp), was amplified once. Multiple plaques hybridizing with each 
of the III-3 GBS-specific probes were purified for further characterization. 

Three overlapping genomic clones hybridizing with probe 1 were identified, with 
approximate sizes of 9, 22, and 23 kb. Since the boundaries of GBS III-3 specific DNA 

15 are not known, smaller fragments were subcloned and the DNA was verified present in 
virulent GBS strains before proceeding with further characterization. The first segment 
examined is a 6.4 kb Sal l-Bg\ II fragment. This genomic DNA is present in all RDP type 
III-3 strains and in none of 17 RDP type III-2 strains. 

Over 90% of this genomic DNA fragment has been sequenced and found to 

20 contain 5 open reading frames ("ORFs"). Three of these are likely to be authentic genes. 
They feature ATG start sites, are preceded by potential ribosomal binding sites and, in 
two cases, stop codons are followed by palindromic sequences that may represent 
transcriptional terminators. They are relatively short, however, and do not show 
significant homology at the nucleic acid or amino acid level with sequences registered 

25 with Genbank or the available bacterial genome databases. 

The two other ORFs appear to be more obvious candidates for virulence genes. 
The spbl gene includes a 1509 bp ORF that is preceded by a potential ribosomal binding 
site 10 bases upstream from an ATG start codon. The predicted protein (502 amino acids 
and Mr 53,446) has the characteristics of a cell wall-bound protein. The N-terminus of 

30 the predicted protein is a hydrophilic, basic stretch of 6 amino acids followed by a 23 
amino acid hydrophobic, proline-rich core, consistent with a signal peptide. The 

10 
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hydrophilic mature protein terminates in a typical LPXTG (SEQ ID NO: 9) domain that 
immediately precedes a hydrophobic 20 amino acid core and a short, basic hydrophilic 
terminus. 

The nucleotide sequence is not homologous to sequences of other known bacterial 
genes. The translated amino acid sequence, however, shares segmental homology with a 
number of characterized proteins, including the fimbrial type 2 protein of Actinomyces 
naeslundii (27% identity over 350 amino acids) and the fimbrial type 1 protein of 
Actinomyces viscosus (25% homology over 420 amino acids), the T6 surface protein of S. 
pyogenes (23% identity over 359 amino acids), and the hsf (27% identity over 260 amino 
acids ) and HMW1 adhesins (25% identity over 285 amino acids) of Haemophilus 
influenzae. See M.K. Yeung & J.O. Cisar, J Bacteriol 172:242-2468 (1990); O. 
Schneewind, et ah, J Bacteriol 172:3310-3317 (1990); J.W. St. Geme III, et al., J 
Bacteriol 178:6281-6287 (1996); J.W. St. Geme III, Infect Immun 62: 3881-3889 (1994). 
The function of the S. pyogenes T6 protein is unknown. Each of the other homologues 
plays a role in bacterial adhesion and/or invasion. 

The second ORF, spb2, terminates 37 bp upstream from spbl and is in the same 
transcriptional orientation. This 1692 bp ORF has a deduced amino acid sequence of 563 
residues and Mr 64,492. It shares 50.5% nucleic acid identity and 20.7% amino acid 
identity with spbl. Conservation is highest in the carboxy-terminal regions, including a 
shared LPSTGG (SEQ ID NO: 10) motif. In contrast to spbl, spb2 does not have an 
obvious signal sequence. Its secretion may be mediated by carboxy-terminal recognition 
sequences or by accessory peptides. T. Michiels, et al., Infect Immun 58:2840-2849 
(1990). The deduced amino acid sequence of spb2 is also homologous with S. pyogenes 
T6 and Actinomyces naeslundii proteins, and to Listeria monocytogenes internalin A 
(22% identity over 308 amino acids) — again, proteins important in adhesion and invasion. 

Neither of the predicted gene products has the repetitive structure of previously 
described GBS surface proteins such as the C and Rib proteins or of L. monocytogenes 
internalin family members. L.C. Madoff et al., Infect Immun 59:2638-2644 (1991); J. 
Gaillard, et al., Cell 65: 1 127-1 141 (1991). Hybridization of the originating strain 
874391 genomic DNA with an spbl probe under low stringency conditions identifies a 
single band in EcoR I, Sal I and Sst I digests (data not shown), suggesting that a single 
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copy of spbl is present in this strain and that spbl is not a member of a significantly 
homologous "family" of genes. 

Example 2 - Adherence and Invasion Assays Using spbl' Bacteria 
Genomic subtraction identified a 1509 bp ORF, spbl, which is present in virulent 
RDP type III-3 GBS and not in RDP type III-2 strains. The predicted 53 kD protein 
product has the characteristics of a typical gram positive cell-wall bound protein. The 
nucleic acid sequence of spbl is not homologous to sequences of other known bacterial 
genes, however the translated amino acid sequence shares segmental homology with 
several characterized adhesins, including Actinomyces fimbrial proteins and H. influenzae 
HMW1, suggesting that Spbl might contribute to GBS adhesion or invasion. A spbl' 
isogenic deletion mutant GBS strain was created by homologous recombination and the 
ability of the spbl' mutant to adhere to and invade A549 respiratory epithelial cells was 
determined. Compared to the wild type strain, the number of spbl' bacteria adherent to 
A549 monolayers was reduced by 60.0% (p<0.01) and the number of intracellular 
invading bacteria was reduced by 53.6% (p<0.01). Without being bound by any 
particular theory, it appears that Spbl may contribute to the pathogenesis of GBS 
pneumonia and bacterial entry into the bloodstream. 
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CLAIMS: 

1. An isolated nucleic acid molecule comprising nucleotides which code for the 
amino acid sequence of SEQ ID NO: 2. 

2. A recombinant vector comprising the nucleic acid molecule of claim 1. 

3. The recombinant vector of claim 2, wherein said recombinant vector is a plasmid. 
5 4. The recombinant vector of claim 2, wherein said recombinant vector is a 

prokaryotic or eukaryotic expression vector. 

5. The recombinant vector of claim 2, wherein the nucleic acid molecule is operably 
linked to a heterologous promoter. 

6. A host cell comprising the nucleic acid molecule of claim 1. 

10 7. The host cell of claim 6, wherein the host cell is a eukaryotic host cell. 

8. The host cell of claim 6, wherein the host cell is a prokaryotic host cell. 

9. An isolated nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO: 1. 

10. A recombinant vector comprising the nucleic acid molecule of claim 9. 
15 11. The recombinant vector of claim 1 0, wherein said recombinant vector is a 

plasmid. 

12. The recombinant vector of claim 10, wherein said recombinant vector is a 
prokaryotic or eukaryotic expression vector. 

13. The recombinant vector of claim 10, wherein the nucleic acid molecule is 
20 operably linked to a heterologous promoter. 

14. A host cell comprising the nucleic acid molecule of claim 9. 

15. The host cell of claim 14, wherein the host cell is a eukaryotic host cell. 

16. The host cell of claim 14, wherein the host cell is a prokaryotic host cell. 

17. An isolated nucleic acid molecule comprising nucleotides which code for the 
25 amino acid sequence of SEQ ID NO: 4 

18. A recombinant vector comprising the nucleic acid molecule of claim 17. 

19. The recombinant vector of claim 1 8, wherein said recombinant vector is a 
plasmid. 

20. The recombinant vector of claim 1 8, wherein said recombinant vector is a 
30 prokaryotic or eukaryotic expression vector. 
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21. The recombinant vector of claim 18, wherein the nucleic acid molecule is 
operably linked to a heterologous promoter. 

22. A host cell comprising the nucleic acid molecule of claim 17. 

23. The host cell of claim 22, wherein the host cell is a eukaryotic host cell. 

24. The host cell of claim 22, wherein the host cell is a prokaryotic host cell. 

25. An isolated nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO: 3. 

26. A recombinant vector comprising the nucleic acid molecule of claim 25. 

27. The recombinant vector of claim 26, wherein said recombinant vector is a 
plasmid. 

28. The recombinant vector of claim 26, wherein said recombinant vector is a 
prokaryotic or eukaryotic expression vector. 

29. The recombinant vector of claim 26, wherein the nucleic acid molecule is 
operably linked to a heterologous promoter. 

30. A host cell comprising the nucleic acid molecule of claim 25. 

31. The host cell of claim 30, wherein the host cell is a eukaryotic host cell. 

32. The host cell of claim 30, wherein the host cell is a prokaryotic host cell. 

33. A method of immunizing a mammal against Group B streptococci infection, said 
method comprising administering to the mammal a vaccine comprising an 
immunologically effective amount of a recombinantly produced protein comprising the 
amino acid sequence of SEQ ID NO: 2. 

34. The method of claim 33, wherein the vaccine further comprises an adjuvant. 

35. The method of claim 34, wherein the adjuvant comprises alum. 

36. The method of claim 33, wherein the vaccine further comprises an 
immunologically effective amount of a recombinantly produced protein comprising the 
amino acid sequence of SEQ ID NO: 4. 

37. The method of claim 36, wherein the vaccine further comprises an adjuvant. 

38. A method of immunizing a mammal against Group B streptococci infection, said 
method comprising administering to the mammal a vaccine comprising an 
immunologically effective amount of a recombinantly produced protein comprising the 
amino acid sequence of SEQ ID NO: 4. 

39. The method of claim 38, wherein the vaccine further comprises an adjuvant. 
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40. The method of claim 39, wherein the adjuvant comprises alum. 

41 . A diagnostic method for determining whether a mammal is infected or colonized 
by virulent Group B streptococci (GBS), said method comprising the steps of: 

(a) collecting a bodily fluid or culture from the mammal; 

(b) analyzing the bodily fluid or culture for the presence or absence of one or 
5 more gene products specific to type III-3 GBS; 

wherein the presence of one or more gene products specific to type III-3 GBS indicates 
infection or colonization by virulent GBS. 

42. The diagnostic method of claim 41, wherein the one or more gene products 
specific to type III-3 GBS comprise a protein, said protein comprising the amino acid 

1 0 sequence of SEQ ID NO: 2. 

43. The diagnostic method of claim 41 , wherein the one or more gene products 
specific to type III-3 GBS comprise a protein, said protein comprising the amino acid 
sequence of SEQ ID NO: 4. 

44. The diagnostic method of claim 41, wherein the one or more gene products 

15 specific to type III-3 GBS comprise a first protein comprising the amino acid sequence of 
SEQ ID NO: 2 and a second protein comprising the amino acid sequence of SEQ ID 
NO: 4. 

45. The diagnostic method of claim 41, wherein the mammal is a human. 

46. The diagnostic method of claim 41, wherein the bodily fluid or culture is a vaginal 

20 or rectovaginal culture. 

47. The diagnostic method of claim 41, wherein the bodily fluid or culture is a throat 

culture. 

48. The diagnostic method of claim 41, wherein the bodily fluid or culture is blood, 
serum, amniotic fluid, cerebrospinal fluid, or joint fluid. 

25 49. The diagnostic method of claim 41, wherein the analysis step comprises using 

polymerase chain reaction (PCR) to identify the presence or absence of one or more gene 
products specific to type III-3 GBS. 

50. The diagnostic method of claim 49, wherein the analysis step comprises using 
PCR to determine the presence or absence of the spbl gene product. 
30 51. The diagnostic method of claim 49, wherein the analysis step comprises using 
PCR to determine the presence or absence of the spb2 gene product. 

15 
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52. The diagnostic method of claim 41, wherein the analysis step comprises using 
antibodies to identify the presence or absence of one or more gene products specific to 
type III-3 GBS. 

53. The diagnostic method of claim 52, wherein the antibodies are monoclonal 
antibodies, 

54. The diagnostic method of claim 53, wherein the monoclonal antibodies are 
specific for the spbl gene product. 

55. The diagnostic method of claim 53, wherein the monoclonal antibodies are 
specific for the spb2 gene product. 

56. An isolated and purified protein comprising the amino acid sequence of SEQ ID 
NO: 2. 

57. A vaccine for immunizing a mammalian host against virulent Group B 
streptococci infection, said vaccine comprising the protein of claim 56. 

58. The vaccine of claim 57, further comprising an adjuvant. 

59. The vaccine of claim 58, wherein the adjuvant comprises alum. 

60. The vaccine of claim 57, wherein the protein is conjugated to a bacterial 
polysaccharide or oligosaccharide. 

61 . An isolated and purified protein comprising the amino acid sequence of SEQ ID 
NO: 4. 

62. A vaccine for immunizing a mammalian host against virulent Group B 
streptococci infection, said vaccine comprising the protein of claim 61. 

63. The vaccine of claim 62, further comprising an adjuvant. 

64. The vaccine of claim 63, wherein the adjuvant comprises alum. 

65. The vaccine of claim 62, further comprising an isolated and purified protein 
comprising the amino acid sequence of SEQ ID NO: 2. 

66. The vaccine of claim 65, further comprising an adjuvant. 

67. The vaccine of claim 62, wherein the protein is conjugated to a bacterial 
polysaccharide or oligosaccharide. 
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<110> Adderson, Elisabeth E. 
Bohnsack, John F. 

University of Utah Research Foundation 

<120> Isolated Genes from Virulent Group B Streptococcus 
agalactiae 

<130> 1321.2.29 

<140> 
<141> 

<150> 60/140,084 

<151> 1999-06-21 

<160> 10 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1509 
<212> DNA 

<213> Streptococcus agalactiae 

<220> 
<221> CDS 
<222> (1) . . (1509) 

<400> 1 

atg aaa aag aaa atg att caa teg 
Met Lys Lys Lys Met lie Gin Ser 
1 5 

ggt atg get gta tea cca gtt acg 
Gly Met Ala Val Ser Pro Val Thr 

20 

ggg aca att aca gtt caa gat act 
Gly Thr lie Thr Val Gin Asp Thr 

35 40 

tat aaa gtt ttt gat gca gaa ata 
Tyr Lys Val Phe Asp Ala Glu lie 
50 55 

aat aaa gat gga get tct tat tta 



ctg tta gtg gcg agt tta gca ttt 48 
Leu Leu Val Ala Ser Leu Ala Phe 
10 15 

ccg ata get ttt gee get gag aca 96 
Pro lie Ala Phe Ala Ala Glu Thr 
25 30 

caa aaa ggc gca ace tat aaa gca 144 
Gin Lys Gly Ala Thr Tyr Lys Ala 

45 

gat aat gca aat gta tct gat teg 192 
Asp Asn Ala Asn Val Ser Asp Ser 

60 

att cct caa ggt aaa gaa get gag 240 



1 
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Asn Lys Asp Gly Ala Ser Tyr Leu lie Pro Gin Gly Lys Glu Ala Glu 

65 70 75 80 

tat aaa get tea act gat ttt aat tct ctt ttt acg aca act act aat 288 

Tyr Lys Ala Ser Thr Asp Phe Asn Ser Leu Phe Thr Thr Thr Thr Asn 

85 90 95 

gga ggg aga aca tat gta act aaa aaa gat act gcg tea gca aat gag 336 

Gly Gly Arg Thr Tyr Val Thr Lys Lys Asp Thr Ala Ser Ala Asn Glu 

100 105 110 



att gcg aca tgg get aaa tct ata tea get aat act aca cca gtt tec 
lie Ala Thr Trp Ala Lys Ser lie Ser Ala Asn Thr Thr Pro Val Ser 
115 120 125 

act gtt act gag tea aat aat gat ggt act gag gtt att aat gtt tec 
Thr Val Thr Glu Ser Asn Asn Asp Gly Thr Glu Val lie Asn Val Ser 
130 135 140 

caa tat gga tat tat tat gtt tct age act gtt aat aat gga get gta 
Gin Tyr Gly Tyr Tyr Tyr Val Ser Ser Thr Val Asn Asn Gly Ala Val 
145 150 155 160 

att atg gtt aca tct gta act cca aat get act att cat gaa aag aat 
lie Met Val Thr Ser Val Thr Pro Asn Ala Thr lie His Glu Lys Asn 

165 170 175 

act gat gcg aca tgg gga gat ggt ggt gga aaa act gta gat caa aaa 
Thr Asp Ala Thr Trp Gly Asp Gly Gly Gly Lys Thr Val Asp Gin Lys 

180 185 190 

acg tac teg gtt ggt gat aca gtc aaa tat act att act tat aag aat 
Thr Tyr Ser Val Gly Asp Thr Val Lys Tyr Thr lie Thr Tyr Lys Asn 
195 200 205 

gca gtc aat tat cat ggt aca gaa aaa gtg tat caa tat gtt ata aag 
Ala Val Asn Tyr His Gly Thr Glu Lys Val Tyr Gin Tyr Val lie Lys 
210 215 220 

gat act atg cca tct get tct gta gtt gat ttg aac gaa ggg tct tat 
Asp Thr Met Pro Ser Ala Ser Val Val Asp Leu Asn Glu Gly Ser Tyr 
225 230 235 240 

gaa gta act att act gat gga tea ggg aat att aca act eta act caa 
Glu Val Thr lie Thr Asp Gly Ser Gly Asn lie Thr Thr Leu Thr Gin 

245 250 255 

ggt teg gaa aaa gca act ggg aag tat aac ctg tta gag gaa aat aat 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 
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Gly Ser Glu Lys Ala Thr Gly Lys Tyr Asn Leu Leu Glu Glu Asn Asn 

260 265 270 

aat ttc acg att act att ccg tgg gca get acc aat act cca acc gga 864 

Asn Phe Thr lie Thr lie Pro Trp Ala Ala Thr Asn Thr Pro Thr Gly 
275 280 285 

aat act caa aat gga get aat gat gac ttt ttt tat aag gga ata aat 912 

Asn Thr Gin Asn Gly Ala Asn Asp Asp Phe Phe Tyr Lys Gly lie Asn 
290 295 300 

aca ate aca gtc act tat aca gga gta tta aag agt gga get aaa cca 960 

Thr lie Thr Val Thr Tyr Thr Gly Val Leu Lys Ser Gly Ala Lys Pro 
305 310 315 320 

ggt tea get gat tta cca gaa aat aca aac att gcg acc ate aac ccc 1008 

Gly Ser Ala Asp Leu Pro Glu Asn Thr Asn lie Ala Thr lie Asn Pro 

325 330 335 

aat act age aat gat gac cca ggt caa aaa gta aca gtg agg gat ggt 1056 

Asn Thr Ser Asn Asp Asp Pro Gly Gin Lys Val Thr Val Arg Asp Gly 

340 345 350 

caa att act ata aaa aaa att gat ggt tec aca aaa get tea tta caa 1104 

Gin lie Thr lie Lys Lys lie Asp Gly Ser Thr Lys Ala Ser Leu Gin 
355 360 365 

ggt get ata ttt gtt tta aag aat get acg ggt caa ttt eta aac ttt 1152 

Gly Ala lie Phe Val Leu Lys Asn Ala Thr Gly Gin Phe Leu Asn Phe 
370 375 380 

aac gat aca aat aac gtt gaa tgg ggc aca gaa get aat gca aca gaa 1200 

Asn Asp Thr Asn Asn Val Glu Trp Gly Thr Glu Ala Asn Ala Thr Glu 
385 390 395 400 

tat aca aca gga gca gat ggt ata att acc att aca ggc ttg aaa gaa 1248 

Tyr Thr Thr Gly Ala Asp Gly lie lie Thr lie Thr Gly Leu Lys Glu 

405 410 415 



ggt aca tac tat eta gtt gag aaa aag get ccc tta ggt tac aat ttg 
Gly Thr Tyr Tyr Leu Val Glu Lys Lys Ala Pro Leu Gly Tyr Asn Leu 

420 425 430 

tta gat aac tct cag aag gtt att tta gga gat gga gee act gat acg 
Leu Asp Asn Ser Gin Lys Val lie Leu Gly Asp Gly Ala Thr Asp Thr 
435 440 445 

act aat tea gat aac ctt tta gtt aac cca act gtt gaa aat aac aaa 



1296 



1344 



1392 
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Thr Asn Ser Asp Asn Leu Leu Val Asn Pro Thr Val Glu Asn Asn Lys 
450 455 460 



ggt act gag ttg cct tea aca ggt ggt att ggt aca aca att ttc tac 
Gly Thr Glu Leu Pro Ser Thr Gly Gly lie Gly Thr Thr lie Phe Tyr 
465 470 475 480 

att ata ggt gca att tta gta ata gga gca ggt ate gtg ctt gtt get 
lie lie Gly Ala lie Leu Val lie Gly Ala Gly lie Val Leu Val Ala 

485 490 495 

cgt cgt cgt tta cgt tct taa 
Arg Arg Arg Leu Arg Ser 

500 



1440 



1488 



1509 



<210> 2 
<211> 502 
<212> PRT 

<213> Streptococcus agalactiae 



<400> 2 

Met Lys Lys Lys 
1 

Gly Met Ala Val 

20 

Gly Thr lie Thr 

35 

Tyr Lys Val Phe 
50 

Asn Lys Asp Gly 
65 

Tyr Lys Ala Ser 

Gly Gly Arg Thr 

100 

lie Ala Thr Trp 
115 

Thr Val Thr Glu 
130 

Gin Tyr Gly Tyr 
145 

lie Met Val Thr 

Thr Asp Ala Thr 

180 

Thr Tyr Ser Val 
195 



Met lie Gin Ser 
5 

Ser Pro Val Thr 

Val Gin Asp Thr 

40 

Asp Ala Glu lie 

55 

Ala Ser Tyr Leu 
70 

Thr Asp Phe Asn 
85 

Tyr Val Thr Lys 

Ala Lys Ser lie 

120 

Ser Asn Asn Asp 
135 

Tyr Tyr Val Ser 
150 

Ser Val Thr Pro 
165 

Trp Gly Asp Gly 

Gly Asp Thr Val 

200 



Leu Leu Val Ala 
10 

Pro lie Ala Phe 
25 

Gin Lys Gly Ala 

Asp Asn Ala Asn 

60 

lie Pro Gin Gly 

75 

Ser Leu Phe Thr 
90 

Lys Asp Thr Ala 
105 

Ser Ala Asn Thr 

Gly Thr Glu Val 

140 

Ser Thr Val Asn 
155 

Asn Ala Thr lie 
170 

Gly Gly Lys Thr 
185 

Lys Tyr Thr lie 



Ser Leu Ala Phe 

15 

Ala Ala Glu Thr 
30 

Thr Tyr Lys Ala 
45 

Val Ser Asp Ser 

Lys Glu Ala Glu 

80 

Thr Thr Thr Asn 

95 

Ser Ala Asn Glu 
110 

Thr Pro Val Ser 
125 

lie Asn Val Ser 

Asn Gly Ala Val 

160 

His Glu Lys Asn 
175 

Val Asp Gin Lys 
190 

Thr Tyr Lys Asn 
205 
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Ala Val Asn Tyr His Gly Thr Glu Lys Val Tyr Gin Tyr Val He Lys 

210 215 220 

Asp Thr Met Pro Ser Ala Ser Val Val Asp Leu Asn Glu Gly Ser Tyr 
225 230 235 240 

Glu Val Thr He Thr Asp Gly Ser Gly Asn He Thr Thr Leu Thr Gin 

245 250 255 

Gly Ser Glu Lys Ala Thr Gly Lys Tyr Asn Leu Leu Glu Glu Asn Asn 

260 265 270 

Asn Phe Thr He Thr He Pro Trp Ala Ala Thr Asn Thr Pro Thr Gly 

275 280 285 

Asn Thr Gin Asn Gly Ala Asn Asp Asp Phe Phe Tyr Lys Gly He Asn 

290 295 300 

Thr lie Thr Val Thr Tyr Thr Gly Val Leu Lys Ser Gly Ala Lys Pro 
305 310 315 320 

Gly Ser Ala Asp Leu Pro Glu Asn Thr Asn He Ala Thr He Asn Pro 

325 330 335 

Asn Thr Ser Asn Asp Asp Pro Gly Gin Lys Val Thr Val Arg Asp Gly 

340 345 350 

Gin He Thr He Lys Lys He Asp Gly Ser Thr Lys Ala Ser Leu Gin 

355 360 365 

Gly Ala He Phe Val Leu Lys Asn Ala Thr Gly Gin Phe Leu Asn Phe 

370 375 380 

Asn Asp Thr Asn Asn Val Glu Trp Gly Thr Glu Ala Asn Ala Thr Glu 
385 3 *0 395 400 

Tyr Thr Thr Gly Ala Asp Gly He He Thr He Thr Gly Leu Lys Glu 

405 410 415 

Gly Thr Tyr Tyr Leu Val Glu Lys Lys Ala Pro Leu Gly Tyr Asn Leu 

420 425 430 

Leu Asp Asn Ser Gin Lys Val He Leu Gly Asp Gly Ala Thr Asp Thr 

435 440 445 

Thr Asn Ser Asp Asn Leu Leu Val Asn Pro Thr Val Glu Asn Asn Lys 

450 455 460 

Gly Thr Glu Leu Pro Ser Thr Gly Gly He Gly Thr Thr He Phe Tyr 
465 4 70 475 480 

He He Gly Ala He Leu Val He Gly Ala Gly He Val Leu Val Ala 

48 5 490 495 

Arg Arg Arg Leu Arg Ser 

500 



PCT/US00/17082 



<210> 3 
<211> 1692 
<212> DNA 

<213> Streptococcus agalactiae 

<220> 
<221> CDS 
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<222> (1) . . (1692) 
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<400> 3 

att tgc att atg gtg ate gta ttc egg att ata cag ata tta caa ggg 48 
lie Cys lie Met Val lie Val Phe Arg lie lie Gin lie Leu Gin Gly 
1 5 10 15 

att ata tec aag ate ctt cag gta cat att att ata agt atg att cac 96 
lie He Ser Lys He Leu Gin Val His He He He Ser Met He His 

20 25 30 

gag ata aag ate ccg act caa eta aag atg cct att ata cga cag ata 144 
Glu He Lys He Pro Thr Gin Leu Lys Met Pro He He Arg Gin He 

35 40 45 

eta gtc tea tea aat gtt gat aca aca act aag tac aag tac gta aaa 192 
Leu Val Ser Ser Asn Val Asp Thr Thr Thr Lys Tyr Lys Tyr Val Lys 
50 55 60 

gac get tac aaa tta gtc ggt tgg tat tat gtt aat cca tat ggt agt 240 
Asp Ala Tyr Lys Leu Val Gly Trp Tyr Tyr Val Asn Pro Tyr Gly Ser 
65 70 75 80 

att aga cct tat aac ttt tea ggt get gta act caa gat ate aat tta 288 
He Arg Pro Tyr Asn Phe Ser Gly Ala Val Thr Gin Asp He Asn Leu 

85 90 95 

aga get att tgg cga aag get gga gat tat cat att ata tac age aat 336 
Arg Ala He Trp Arg Lys Ala Gly Asp Tyr His He He Tyr Ser Asn 

100 105 110 

gat get gtt ggt aca gat gga aag cca gca ttg gat get tct ggt cag 3 84 
Asp Ala Val Gly Thr Asp Gly Lys Pro Ala Leu Asp Ala Ser Gly Gin 
115 120 125 

caa tta caa aca agt aat gag cct act gac cct gat tec tat gac gat 432 
Gin Leu Gin Thr Ser Asn Glu Pro Thr Asp Pro Asp Ser Tyr Asp Asp 
130 135 140 

ggc tec cat tea gee tta ctg aga cgt ccg aca atg cca gat ggc tat 480 
Gly Ser His Ser Ala Leu Leu Arg Arg Pro Thr Met Pro Asp Gly Tyr 
145 150 155 160 

cgt ttc cgt ggc tgg tgg tac aat ggt aaa att tat aac cca tat gat 528 
Arg Phe Arg Gly Trp Trp Tyr Asn Gly Lys He Tyr Asn Pro Tyr Asp 

165 170 175 

tec att gat att gac gee cat tta gca gat get aat aaa aat ate ace 576 
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Ser lie Asp lie Asp Ala His Leu Ala Asp Ala Asn Lys Asn lie Thr 

180 185 190 

ata aaa cct gtc att att cca gta gga gat ate aaa tta gaa gat acc 624 
lie Lys Pro Val lie lie Pro Val Gly Asp lie Lys Leu Glu Asp Thr 
195 200 205 

tec ate aaa tac aat ggt aac ggt ggt act aga gta gaa aat ggt aat 672 
Ser lie Lys Tyr Asn Gly Asn Gly Gly Thr Arg Val Glu Asn Gly Asn 
210 215 220 

gtg gta aca caa gtg gag aca ccg cgt atg gag ttg aat age aca act 72 0 
Val Val Thr Gin Val Glu Thr Pro Arg Met Glu Leu Asn Ser Thr Thr 
225 230 235 240 

aca att cct gaa aac caa tac ttt aca agg aca ggt tac aac ctt att 768 
Thr lie Pro Glu Asn Gin Tyr Phe Thr Arg Thr Gly Tyr Asn Leu lie 

245 250 255 

ggt tgg cat cat gat aag gat tta get gat aca gga cgt gtg gaa ttt 816 
Gly Trp His His Asp Lys Asp Leu Ala Asp Thr Gly Arg Val Glu Phe 

260 265 270 

aca gca ggt caa tea ata ggt att gat aac aac ctt gat gca aca aat 864 
Thr Ala Gly Gin Ser lie Gly lie Asp Asn Asn Leu Asp Ala Thr Asn 
275 280 285 

acc tta tat get gtt tgg caa cct aaa gaa tac acc gtc gga gta agt 912 
Thr Leu Tyr Ala Val Trp Gin Pro Lys Glu Tyr Thr Val Gly Val Ser 
290 295 300 

aaa act gtc gtt gga eta gat gaa gat aag acg aaa gac ttc ttg ttt 960 
Lys Thr Val Val Gly Leu Asp Glu Asp Lys Thr Lys Asp Phe Leu Phe 
305 310 315 320 

aat cca agt gaa acg ttg caa caa gag aat ttt ccg ctg aga gat ggt 1008 
Asn Pro Ser Glu Thr Leu Gin Gin Glu Asn Phe Pro Leu Arg Asp Gly 

325 330 335 

cag act aag gaa ttt aaa gta cct tat gga act tct ata tea ata gat 1056 
Gin Thr Lys Glu Phe Lys Val Pro Tyr Gly Thr Ser lie Ser lie Asp 

340 345 350 

gaa caa gee tac gat gaa ttt aaa gta tct gag tea att aca gaa aaa 1104 
Glu Gin Ala Tyr Asp Glu Phe Lys Val Ser Glu Ser lie Thr Glu Lys 
355 360 365 

aat eta gca act ggt gaa get gat aaa act tat gat get acc ggc tta 1152 
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Asn Leu Ala Thr Gly Glu Ala Asp Lys Thr Tyr Asp Ala Thr Gly Leu 
370 375 380 



aca gaa ttg cct agt act gga ggt att gga aca cac att tat ate eta 
Thr Glu Leu Pro Ser Thr Gly Gly lie Gly Thr His lie Tyr lie Leu 
530 535 540 

gtt ggt tta get tta get eta cca tea gga tta ata tta tac tat cga 
Val Gly Leu Ala Leu Ala Leu Pro Ser Gly Leu lie Leu Tyr Tyr Arg 
545 550 555 560 

aaa aaa ata tga 



1248 



1296 



caa tec ctg aca gtt tea gga gac gta gat att age ttt ace aat aca 12 00 
Gin Ser Leu Thr Val Ser Gly Asp Val Asp lie Ser Phe Thr Asn Thr 
385 390 395 400 

cgt ate aag caa aaa gta cga eta cag aaa gtt aat gtc gaa aat gat 
Arg lie Lys Gin Lys Val Arg Leu Gin Lys Val Asn Val Glu Asn Asp 

405 410 415 

aat aat ttt tta gca ggt gca gtt ttt gat att tat gaa tea gat get 
Asn Asn Phe Leu Ala Gly Ala Val Phe Asp lie Tyr Glu Ser Asp Ala 

420 425 430 

aat ggg aat aaa get tea cat cct atg tat tea ggg ctg gtg aca aac 1344 
Asn Gly Asn Lys Ala Ser His Pro Met Tyr Ser Gly Leu Val Thr Asn 
435 440 445 

gat aaa ggc ttg tta tta gtg gat get aat aac tac etc agt ttg cca 1392 
Asp Lys Gly Leu Leu Leu Val Asp Ala Asn Asn Tyr Leu Ser Leu Pro 
450 455 460 

gta gga aaa tac tac eta aca gag aca aag gee cct cca ggg tac eta 
Val Gly Lys Tyr Tyr Leu Thr Glu Thr Lys Ala Pro Pro Gly Tyr Leu 
465 470 475 480 

eta cct aaa aat gat gat ata tea gta tta gtg att tct acg gga gtt 
Leu Pro Lys Asn Asp Asp He Ser Val Leu Val He Ser Thr Gly Val 

485 490 495 

acc ttt gaa caa aat ggt aat aat gcg aca cca ata aaa gag aat tta 
Thr Phe Glu Gin Asn Gly Asn Asn Ala Thr Pro He Lys Glu Asn Leu 

500 505 510 

gtg gat gga agt aca gta tat act ttt aaa att act aac agt aaa gga 15 84 
Val Asp Gly Ser Thr Val Tyr Thr Phe Lys He Thr Asn Ser Lys Gly 
515 520 525 



1440 



1488 



1536 



1632 



1680 



1692 
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<210> 4 
<211> 563 
<212> PRT 

<213> Streptococcus agalactiae 



<400> 4 

lie Cys lie Met Val lie Val Phe Arg lie lie Gin lie Leu Gin Gly 

15 10 15 

lie lie Ser Lys lie Leu Gin Val His lie lie lie Ser Met lie His 

20 25 30 

Glu lie Lys lie Pro Thr Gin Leu Lys Met Pro lie lie Arg Gin lie 

35 40 45 

Leu Val Ser Ser Asn Val Asp Thr Thr Thr Lys Tyr Lys Tyr Val Lys 

50 55 60 

Asp Ala Tyr Lys Leu Val Gly Trp Tyr Tyr Val Asn Pro Tyr Gly Ser 
65 70 75 80 

lie Arg Pro Tyr Asn Phe Ser Gly Ala Val Thr Gin Asp lie Asn Leu 

85 90 95 

Arg Ala lie Trp Arg Lys Ala Gly Asp Tyr His lie lie Tyr Ser Asn 

100 105 110 

Asp Ala Val Gly Thr Asp Gly Lys Pro Ala Leu Asp Ala Ser Gly Gin 

115 120 125 

Gin Leu Gin Thr Ser Asn Glu Pro Thr Asp Pro Asp Ser Tyr Asp Asp 

130 135 140 

Gly Ser His Ser Ala Leu Leu Arg Arg Pro Thr Met Pro Asp Gly Tyr 
145 150 155 160 

Arg Phe Arg Gly Trp Trp Tyr Asn Gly Lys lie Tyr Asn Pro Tyr Asp 

165 170 175 

Ser lie Asp lie Asp Ala His Leu Ala Asp Ala Asn Lys Asn lie Thr 

180 185 190 

lie Lys Pro Val lie lie Pro Val Gly Asp lie Lys Leu Glu Asp Thr 

195 200 205 

Ser lie Lys Tyr Asn Gly Asn Gly Gly Thr Arg Val Glu Asn Gly Asn 

210 215 220 

Val Val Thr Gin Val Glu Thr Pro Arg Met Glu Leu Asn Ser Thr Thr 
225 230 235 240 

Thr lie Pro Glu Asn Gin Tyr Phe Thr Arg Thr Gly Tyr Asn Leu lie 

245 250 255 

Gly Trp His His Asp Lys Asp Leu Ala Asp Thr Gly Arg Val Glu Phe 

260 265 270 

Thr Ala Gly Gin Ser lie Gly lie Asp Asn Asn Leu Asp Ala Thr Asn 

275 280 285 

Thr Leu Tyr Ala Val Trp Gin Pro Lys Glu Tyr Thr Val Gly Val Ser 

290 295 300 

Lys Thr Val Val Gly Leu Asp Glu Asp Lys Thr Lys Asp Phe Leu Phe 
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305 310 315 320 

Asn Pro Ser Glu Thr Leu Gin Gin Glu Asn Phe Pro Leu Arg Asp Gly 

325 330 335 

Gin Thr Lys Glu Phe Lys Val Pro Tyr Gly Thr Ser lie Ser lie Asp 

340 345 350 

Glu Gin Ala Tyr Asp Glu Phe Lys Val Ser Glu Ser lie Thr Glu Lys 

355 360 365 

Asn Leu Ala Thr Gly Glu Ala Asp Lys Thr Tyr Asp Ala Thr Gly Leu 

370 375 380 

Gin Ser Leu Thr Val Ser Gly Asp Val Asp lie Ser Phe Thr Asn Thr 
385 390 395 400 

Arg lie Lys Gin Lys Val Arg Leu Gin Lys Val Asn Val Glu Asn Asp 

405 410 415 

Asn Asn Phe Leu Ala Gly Ala Val Phe Asp lie Tyr Glu Ser Asp Ala 

420 425 430 

Asn Gly Asn Lys Ala Ser His Pro Met Tyr Ser Gly Leu Val Thr Asn 

435 440 445 

Asp Lys Gly Leu Leu Leu Val Asp Ala Asn Asn Tyr Leu Ser Leu Pro 

450 455 460 

Val Gly Lys Tyr Tyr Leu Thr Glu Thr Lys Ala Pro Pro Gly Tyr Leu 
465 470 475 480 

Leu Pro Lys Asn Asp Asp lie Ser Val Leu Val lie Ser Thr Gly Val 

485 490 495 

Thr Phe Glu Gin Asn Gly Asn Asn Ala Thr Pro lie Lys Glu Asn Leu 

500 505 510 

Val Asp Gly Ser Thr Val Tyr Thr Phe Lys lie Thr Asn Ser Lys Gly 

515 520 525 

Thr Glu Leu Pro Ser Thr Gly Gly lie Gly Thr His lie Tyr lie Leu 

530 535 540 

Val Gly Leu Ala Leu Ala Leu Pro Ser Gly Leu lie Leu Tyr Tyr Arg 
545 550 555 560 

Lys Lys lie 



<210> 5 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
oligonucleotide 

<400> 5 

ctaggtggat ccttcggcaa t 



10 



WO 00/78787 PCT/USOO/17082 

<210> 6 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
oligonucleotide 

<400> 6 

cgattgccga 10 



<210> 7 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
oligonucleotide 

<400> 7 

aggcaactgt gctaaccgag ggaat 25 



<210> 8 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
oligonucleotide 

<400> 8 

cgattccctc g 11 



<210> 9 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Consensus 
sequence 
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<400> 9 

Leu Pro Xaa Thr Gly 
1 5 



<210> 10 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Consensus 
sequence 

<400> 10 

Leu Pro Ser Thr Gly Gly 
1 5 
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Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet. 



1. I x| A* au " required additional search fees were timely paid by the applicant, this international search report covers all searchable 

claims. 

2. | [ As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. | | As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. [ | No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest | x| The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 
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International application No. 
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A. CLASSIFICATION OF SUBJECT MATTER: 
IPC (7): 

C07H 21/04; C07K 1/00, A61K 39/09, 39/38, 39/39, 38/385; C12N 15/00, 15/09; C12P 21/04, G01N 33/53, 33/569 

A. CLASSIFICATION OF SUBJECT MATTER: 
US CL : 

536/23.7; 530/350, 825; 424/244.1, 184.1, 236.1, 234.1, 193.1; 435/320.1, 69.3, 71.1, 7.1, 7.34 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

Group I, claim(s) 1-16, 33-35 and 56-60, drawn to an isolated nucleic acid molecule encoding SEQ ID NO: 2, a vector 
and a host cell comprising the same and a method of immunizing a mammal by administering a protein comprising SEQ 
ID NO: 2. 

Group II, claim(s) 17-32, 38-40, 61-64 and 67, drawn to an isolated nucleic acid molecule encoding SEQ ID NO: 4, a 
vector and a host cell comprising the same and a method of immunizing a mammal by administering a protein 
comprising SEQ ID NO: 4. 

Group III, claim(s) 42, 50 and 54, drawn to a diagnostic method comprising analyzing for the presence or absence of 
SEQ ID NO: 2. 

Group IV, claims 43, 51 and 55, drawn to a diagnostic method comprising analyzing for the presence or absence of 
SEQ ID NO: 4. 

Claims 36, 37, 65 and 66 are considered linking claims and would be joined with Group I or II, if elected. 
Claims 41, 44-49, 52 and 53 are considered linking claims and would be joined with Group II or III, if 

elected. 



The inventions listed as Groups I-IV do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: 
Group I is directed to a nucleic acid molecule of SEQ ID NO:l, a protein with amino acid sequence of SEQ ID NO: 2, 
a vector and a host cell comprising the nucleic acid molecule and the first method of use of the protein in immunization. 
This is a permitted category under PCT Rule 13.2. Group II is directed to a second product, a nucleic acid molecule of 
SEQ ID NO:3, protein with amino acid sequence of SEQ ID NO: 4, a vector and a host cell comprising the nucleic 
acid molecule and a method of immunizing a mammal by administering the protein. Groups III and IV are directed 
respectively to a second method of use of the gene products, i.e., diagnostic methods for analyzing spbl and spb2 gene 
products respectively. The SEQ ID NO.s of inventions I and II are the special technical features, which distinct from 
one another in their structure and specificity. The inventions therefore lack unity because the special technical feature is 
not a unifying feature. 
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